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FOREWORD 


This  Manual  of  statistical  procedures  is  intended  for  use  by  scientists 
and  engineers.  However,  especially  in  the  planning  stages  of  tests  and 
experiments,  it  is  advisable  to  supplement  the  information  by  occasional 
consultations  with  statisticians. 

The  proper  use  of  modern  statistical  methods  provides  not  merely  a 
precise  summary  of  the  conclusions  that  may  be  drawn  from  an  experiment 
already  performed  but  also,  with  a  small  amount  of  prior  information  or 
by  making  reasonable  assumptions,  a  reliable  prediction  of  the  information 
that  can  be  gained  from  a  proposed  experiment.  The  latter  feature  has  to 
some  extent  already  placed  the  planning  of  tests  and  experiments  on  a 
substantially  more  rational  basis,  and  further  progress  can  be  expected 
in  the  future. 

Although  the  Manual  is  not  intended  to  be  a  textbook  or  treatise  on  the 
one  hand,  or  merely  a  set  of  tables  on  the  other,  it  is  rendered  somewhat 
self-contained  by  the  inclusion  of  Chapter  1,  Definitions  and  Distributions, 
and  the  Appendix  tablet  and  charts.  Attention  is  called  particularly  to 
Appendix  Table  8,  which  enables  confidence  limits  for  a  standard  devia¬ 
tion  to  be  obtained  immediately,  and  to  Charts  II--X,  which  facilitate  the 
determination  of  the  sample  sizes  required  for  various  experiments. 

The  preparation  of  the  Manual  was  first  undertaken  at  the  Naval  Ord¬ 
nance  Test  Station  in  1948,  under  Task  Assignment  NOTS-36-Re3d-439-3, 
as  authorized  by  Bureau  of  Ordnance  letter  NP36(Re3d) AAF:bc  dated 
26  October  1948,  which  also  provided  for  the  basic  statistical  study  of  the 
programs  of  the  Station.  Work  on  the  manuscript  was  not  completed  at 
the  termination  of  this  task  assignment,  and  the  continuation  of  its  prep¬ 
aration  from  1950  to  1955  was  supported  by  funds  from  exploratory  and 
foundational  research  and  general  overhead.  Although  the  book  was 
prepared  primarily  for  use  at  the  Station,  it  may  well  be  of  value  at  other 
Department  of  Defense  establishments. 
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The  Manual  was  reviewed  for  technical  accuracy  in  three  sections: 
Chapters  1,  2,  and  3  were  reviewed  by  R.  M.  McClung,  A.  W.  Nelson, 
and  Nancy  L.  Seeley;  Chapters  4,  7,  and  8,  by  D.  E.  Hartvigsen,  Paul 
Peach,  and  R.  W.  Reynolds;  and  Chapters  5  and  6,  by  J.  R.  Harvey,  J.  P. 
Vanderbeck,  D.  S.  Villars,  and  J.  E.  Walsh.  In  addition,  the  Manual  was 
reviewed  as  a  whole  by  J.  R.  Harvey,  R  M.  McClung,  H.  A.  Meneghelli, 
Paul  Peach,  and  J.  E.  Walsh. 

Comments  and  corrections  will  be  appreciated.  They  should  be  directed 
to  Commander,  U.  S.  Naval  Ordnance  Test  Station  (Code  507),  China 
Lake,  California. 


D.  B.  YOUNG,  CAPT.,  USN 
Commander 


WM.  B.  McLEAN 
Technical  Director 


INTRODUCTION 


The  Manual  has  been  prepared  for  those  engineers  and  scientists  who  wish 
to  use  statistical  procedures  to  improve  the  efficiency  of  their  experiments. 
Emphasis  is  placed  on  (1)  consulting  with  a  statistician  while  an  experi¬ 
ment  is  in  the  planning  stage,  (2)  specifying  what  a  decision  in  a  statistical 
test  means  by  stating  the  risks  involved,  and  (3)  attaching  confidence 
limits  to  an  estimate.  The  reader  who  frequently  needs  to  use  statistical 
methods  should  find  a  university  course  helpful.  A  group  of  engineers  and 
scientists  who  desire  specific  instruction  might  profitably  organize  a  seminar 
under  the  leadership  of  a  statistician. 

Users  of  the  Manual  may  find  the  following  procedure  useful: 

1.  Read  Chapter  1  for  basic  definitions.  Since  this  is  a  manual  rather 
than  a  textbook,  the  explanatory  material  is  brief.  References  1.3 
and  1.5  are  suggested  for  fuller  explanations  of  points  that  seem 
difficult.1 

2.  Study  the  Table  of  Contents  so  that  the  appropriate  part  of  the  book 
can  be  found  readily  when  a  problem  arises.  Specific  items  are  listed 
in  the  Index. 

3.  Consult  with  a  , statistics,  t  while  the  experiment  is  being  designed.  He 
can  indicate  the  sections  in  the  Manual  that  describe  the  proper 
technique  to  be  used. 

4.  Read  the  general  explanation  of  the  technique  to  be  used  and  note 
the  formulas  and  relevant  Appendix  tables  or  charts.  Substitute  the 
pertinent  data  in  the  general  formulas,  using  a  worked  example  as 
a  guide. 

1  Reference  numbers  refer  to  the  publications  listed  in  the  Bibliography  section  of 
each  chapter.  Thus,  Ref.  1.5  is  the  fifth  bibliographical  reference  at  the  end  of 
Chapter  1.  All  publications  mentioned  are  available  in  the  Technical  Library  at  the 
Naval  Ordnance  Test  Station. 
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5.  For  fuller  treatments,  refer  to  the  various  texts  and  papers  listed  in 
the  Bibliography  at  the  end  of  each  chapter.  Derivations  and  proofs 
have  been  omitted  in  an  effort  to  make  the  book  useful  to  a  reader 
with  only  a  general  mathematical  background.  (The  occasional 
occurrence  of  an  integral  sign  should  not  deter  the  experimenter  who 
is  rusty  in  mathematics.  The  sign  is  merely  a  convenient  symbol  for 
the  area  under  a  curve,  and  the  techniques  of  calculus  are  not  needed.) 

The  application  of  statistics  in  experimentation  may  be  divided  into  three 
main  steps,  as  follows: 

1.  Design  of  the  experiment.  Conclusive  results  from  an  experiment 
depend  upon  an  efficient  plan.  Properly  designing  the  experiment  pays 
dividends  in  time  and  money  by  decreasing  the  number  of  computations 
and  increasing  the  amount  of  information  obtainable.  Planning  includes 
tfte  specification  of  a  mathematical  statistical  model  for  the  phenomena 
being  investigated.  It  is  helpful,  but  not  always  possible,  to  specify  the 
form  of  the  equations;  the  unknown  element  then  consists  of  a  set  of 
constants  called  "parameters"  (denoted  by  Greek  letters,  such  as  /i  and  a). 

In  nearly  all  of  the  methods  in  the  Manual,  it  is  assumed  that  it  is  most 
practical  to  take  a  prescribed  number  of  observations.  However,  if  the 
results  of  each  trial  or  group  of  trials  can  be  made  available  without  delay, 
it  is  economical  to  judge  the  accumulated  data  at  each  step  by  the  method 
of  "sequential  analysis.”  Two  important  types  of  sequential  analysis  are 
given  in  Sec.  8.4,  and  many  other  sequential  methods  are  available  in  the 
references. 

2.  Reduction  of  data — estimation.  By  following  the  appropriate  example 
in  the  Manual,  experimenters  will  be  able  to  compute  characteristics  that 
summarize  their  data.  Such  a  characteristic  of  the  sample  is  called  a 
"statistic”  and  is  denoted  by  an  italic  letter  such  as  x  or  s.  Those  statistics 
which  are  the  best  possible  estimates  (evaluations)  of  the  unknown  param¬ 
eters  are  of  particular  interest.  Experimenters  sometimes  use  rough  methods 
of  data  reduction  because  the  best  methods  are  not  easily  available,  but  the 
use  of  such  methods  is  often  equivalent  to  throwing  away  a  large  propor¬ 
tion  of  the  observations.  This  Manual  is  intended  to  make  the  best  methods 
easily  available. 
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3.  Examination  of  significance.  To  an  extent  that  depends  upon  the 
original  experimental  design,  it  will  be  possible  to  test  hypotheses  about 
the  "population”  on  the  basis  of  the  experimental  "sample”  from  that  pop¬ 
ulation,  or  to  state  the  limits  of  precision  in  the  estimation  of  parameters. 
If  reasonable  care  is  taken,  and  if  the  risks  of  error  are  included,  these 
conclusions  can  be  stated  rigorously. 
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STATISTICS  MANUAL 

With  Examples  Taken  From 
Ordnance  Development 


Chapter  1 

DEFINITIONS  AND  DISTRIBUTIONS 

1.1.  Definitions 

1.1.1.  Population  and  Sample 

The  population  is  the  whole  class  about  which  conclusions  are  to  be 
drawn;  for  instance,  a  production  lot  of  propellant  grains,  rather  than 
100  grains  selected  as  a  sample  from  that  lot.  Frequently  it  is  impossible, 
or  at  least  impractical,  to  take  measurements  on  the  whole  population.  In 
such  a  case,  measurements  are  taken  on  a  sample  drawn  from  the  popula¬ 
tion,  and  the  findings  from  the  sample  are  generalized  to  obtain  conclusions 
about  the  whole  population.  For  example,  if  the  population  is  made  up 
of  the  time  to  target  of  every  rocket  of  a  certain  design,  the  population 
may  be  so  large  as  to  be  considered  infinite.  By  launching  only  a  sample 
of  the  rockets,  instead  of  expending  every  rocket  made,  information  can 
still  be  gained  about  the  time  to  target  of  all  rockets  of  the  given  design. 

To  permit  good  generalization  to  the  population,  a  random  sample  is 
selected;  that  is,  the  sample  is  chosen  in  such  a  manner  that  every  individual 
in  the  population  has  an  equal  chance  of  being  chosen  for  the  sample. 
For  instance,  if  there  are  1,000  rockets  in  storage  and  we  want  to  estimate 
time  to  target  by  firing  50  of  them,  the  50  should  be  chosen  by  lot  from 
among  the  1,000,  as  in  a  raffle,  or  by  using  a  table  of  random  numbers 
(Ref.  1.7)  so  that  each  has  an  equal  chance  of  being  one  of  the  50  fired. 
In  many  cases  where  a  true  random  selection  is  not  feasible,  an  adequate 
substitute  can  be  found  (Ref.  1.6,  Chap.  8).  For  a  test  of  randomness,  see 
Sec.  4.1  of  this  Manual. 

If  separate  random  samples  are  drawn.,  the  two  samples  are  independ¬ 
ent  if  we  select  the  second  sample  by  lot  with  no  reference  to  the  make-up 
of  the  first  sample.  Thus,  in  comparing  the  performance  of  a  rocket  of 
type  A  with  that  of  a  rocket  of  type  B  on  the  basis  of  a  sample  of  10  of 
each  type,  we  should  not  let  the  choice  of  the  10  type- A  rockets  influence 
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the  choice  of  the  10  type-B  rockets  in  any  way.  (Sometimes  the  time  at 
which  each  observation  is  made  exerts  such  an  influence.)  We  speak  of 
two  variables  as  being  independent  when  fixing  the  value  of  one  has  no 
effect  on  the  relative  frequency  with  which  each  value  of  the  other  is  to  be 
expected.  For  instance,  the  horizontal  deflection  and  the  vertical  deflection 
of  shots  at  a  target  are  independent  if  fixing  the  horizontal  deflection  has 
no  effect  on  the  probability  of  obtaining  each  vertical  deflection.  Also,  in 
the  case  of  an  infinite  population,  the  observations  in  a  random  sample  are 
all  independent  of  one  another. 

When  two  events  are  independent,  the  probability  that  they  both  happen 
can  be  found  by  multiplying  the  probability  of  the  first  event  by  the  prob¬ 
ability  of  the  second  event.  ("Probability”  is  defined  in  Sec.  1.1.2.)  For 
instance,  if  the  horizontal  deflection  of  a  rocket  has  a  probability  0.7  of 
being  less  than  10  mils  and  is  independent  of  the  vertical  deflection,  which 
has  a  probability  0.3  of  being  less  than  10  mils,  the  probability  that  the 
rocket  will  land  in  a  position  such  that  both  its  horizontal  and  vertical 
deflections  are  less  than  10  mils  is  0.7  X  0.3  =  0.21. 

1.1.2.  Distribution 

To  get  a  good  picture  of  the  sample  as  a  whole  when  the  sample  is  fairly 
large,  it  is  desirable  to  construct  a  frequency  diagram  which  will  show 
the  number  of  observations  that  occur  within  a  given  interval. 

In  Table  1.1,  50  measurements  of  range  of  a  particular  type  of  rocket 
are  listed;  in  Table  1.2,  the  frequency  distribution  of  these  measurements 


TABLE  l.l.  Measurements  (in  Yards)  of  Range  of 
a  Type  of  Rocket 


1,472 

1,799 

1,850 

1,251 

2,107 

1,315 

1,372 

1,417 

1,422 

1,668 

1,984 

1,420 

2,057 

1,506 

1,411 

1,900 

1,612 

1,468 

1,651 

1,654 

1,646 

1,546 

1,694 

1,687 

1,503 

~  1,709 

1,255 

1,776 

1,080 

1,934 

1,780 

1,818 

1,456 

1,8  66 

1,451 

1,571 

1,357 

1,489 

1,713 

1,240 

1,681 

1,412 

1,618 

1,624 

1,500 

1,453 

1,775 

1,544 

1,370 

1,606 
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TABLE  1.2.  Frequency  Distribution  of 
Data  in  Table  l.i 


Range,  yd 

Frequency 
(No.  of  rounds) 

999.5-1,099-5 

1 

1,099.5-1,199.5 

0 

1,199.5-1,299.5 

3 

1,299.5-1,399.5 

4 

1,399.5-1,499.5 

11 

1,499.5-1,599.5 

6 

1,599.5-1,699.5 

11 

1,699.5-1,799.5 

6 

1,799.5-1,899.5 

3 

1,899.5-1,999.5 

3 

1,999.5-2,099.5 

1 

2.099.5-2,199.5 

1 

Total 

50 

is  given.  Most  of  the  information  is  preserved  if  the  data  are  grouped  into 
intervals  of  100  yards,  and  it  is  customary  to  begin  the  intervals  at  999.5, 
1,099-5,  1,199.5,  etc.,  in  order  to  show  clearly  the  group  in  which  each 
measurement  belongs.  Since  the  import  of  the  data  is  more  quickly  grasped 
if  the  frequency  distribution  is  presented  in  graphical  form,  a  diagram  of 
the  data  is  made,  as  shown  in  Fig.  1.1.  Such  an  arrangement  can  easily  be 
made  with  a  typewriter. 

It  is  often  of  more  interest  to  know  the  relative  frequency  or  per¬ 
centage  frequency  (100'  times  the  relative  frequency)  of  observations 
falling  in  a  given  interval  than  the  actual  number.  In  Fig.  1 .2,  which  is  a 
histogram  or  bar  diagram,  the  percentage  frequencies  -tell  as  the 
actual  frequencies  are  given.  (If  there  were  more  sample  members  and 
shorter  intervals,  the  outline  of  the  histogram  would  probably  show  a 
smoother  curve.) 

In  dealing  with  a  population,  rather  than  a  sample,  we  use  the 
theoretical  equivalent  of  the  relative  frequency;  that  is,  the  probability 
of  observations  failing  in  a  given  interval  or,  more  generally,  the  prob¬ 
ability  of  any  event  out  of  a  given  set  of  events.  Since  the  probability  of 
an  event  E,  denoted  by  P(E),  is  essentially  defined  as  the  relative  frequency 
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RANGE 
IN  YD 

1,000 

1,100 

1,200 

1,300 

1,400 

1,500 

1,600 

1,700 

1,800 

1,900 

2,000 

2,100 

2,200 


FREQUENCY 
0  5  10  15 


X 

XXX 

xxxx 

xxxxxxxxxxx 

xxxxxx 

xxxxxxxxxxx 

xxxxxx 

XXX 

XXX 

X 

X 


FIG.  1.1.  A  Graphical  Presentation  of  a  Frequency  Distribution.  (Precise  intervals 
begin  at  999.5,  1,099.5,  etc.) 


of  occurrence  of  £  in  a  long  series  of  trials,  any  probability  lies  in  the  range 
from  0  to  1.  Probabilities  can  be  operated  on  according  to  two  basic  laws. 
For  any  two  events,  denoted  by  A  and  B : 

(1)  P(A  and  B)  —  P(A)  X  P(B  if  A  has  occurred) 

=  P(B)  X  P(A  if  B  has  occurred) 
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( la)  If  A  and  B  are  independent  events 

P(A  and  B)  =  P(A)  X  P{B) 

(Examples  at  end  of  Sec.  1.1.1  and  1.2.1.) 

(2)  P(A  or  B  or  both)  =  P(A)  +  P(B)  -  P(A  and  B) 

(2a)  If  A  and  B  are  mutually  exclusive  events 

P(A  or  B  or  both)  =  P(A  or  B)  =  P(A)  +  P(B) 

These  can  be  extended  to  any  number  of  events  (Ref.  1.2). 

As  an  analog  of  the  sample  frequency  distribution,  we  introduce  the 
population  frequency  distribution  or  probability  distribution  of  a 

variable  x,  often  called  merely  the  distribution  of  x.  (In  the  example 
given  at  the  beginning  of  this  section,  x  would  be  the  range  of  the  rocket.) 
The  distribution  of  x  shows  the  relative  frequencies  with  which  all  possible 
values  of  x  occur.  A  population  distribution  can  be  pictured  in  terms  of  a 
curve  /(x)„  which  is  well  approximated  by  a  relative-frequency  histogram, 
made  with  rectangles  having  unit  width,  for  a  large  sample  from  the 
population. 

To  see  the  precise  meaning  of  the  curve  /(x),  let  A F  be  the  probability 
that  an  observation  will  fall  in  the  interval  of  length  Ax  extending  from 


FIG.  1.2.  A  Histogram. 
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x  to  x  +  Ax.  Then  AF/Ax  can  by  physical  analogy  be  called  the  average 
probability  density  in  the  interval.  If  we  let  Ax  approach  zero,  the  average 
density  can  be  considered  as  approaching  a  limiting  value  /(x),  which  we 
call  the  probability  density  at  x.  This  is  represented  in  symbols  by  the 
equation 

A  F  dF 

We  therefore  call  /(x)  the  probability  density  function  of  x.  It  is  also 
called  the  distribution  function  of  x  or,  loosely,  the  distribution  of  x. 
When  /(x)  is  drawn  as  a  graph  (Fig.  1.3),  the  probability  that  a  random 
observation  x  will  fall  in  any  interval  a  to  b  is  the  area  under  the  curve 
/(x)  from  a  to  b.  This  area  is  expressed  as  an  integral 

P(a  <  x  <  b)  =  j"  f(x)dx 

ji 


FIG.  1.3.  A  Probability  Density  Function. 


The  cumulative  distribution  function  F(x)  (Fig.  1.4)  is  a  function 
whose  value  at  each  point  c  is  the  probability  that  a  random  observation  x 
will  be  less  than  or  equal  to  c.  Thus,  at  each  point  c  the  height  of  F(c)  is 
equal  to  the  area  under  /(x)  from  -  oo  through  c.  It  is  the  cumulative  dis¬ 
tribution  function  that  is  usually  tabulated,  and  enables  us  to  get  a  probabil¬ 
ity  over  any  interval  simply  by  subtracting  one  area  from  another: 
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P(a  <*<£)  =  f  f{x)dx  -  f  f(x)dx  =  F(b)  -  F(a ) 

J  -CO  ./  -CO 

Since  a  random  observation  x  is  sure  to  have  some  value,  the  total  area 
under  the  curve  f(x)  is  unity: 


Correspondingly,  the  cumulative  distribution  function  approaches  1  as  x 
gets  large:  F(+  oo)  =  1.  Likewise  F(  —  oo)  =  0.  Also,  the  probability  density 
function  /( 3),  for  example,  is  not  itself  the  probability  of  obtaining  a 
random  reading  of  3,  since  the  probability  of  obtaining  a  particular  mathe¬ 
matical  number  for  a  random  observation  is  zero;  that  is,  for  any  number  a, 
the  area  at  a  is 

f(x)dx  =  0 

This  discussion  of  probability  distributions  has  considered  just  one  of 
two  main  types,  in  which  the  variable  may  take  all  values  in  an  interval  and 

F(X) 


FIG.  1.4.  A  Cumulative  Distribution  Function. 
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is  represented  by  a  smooth  curve.  In  the  other  main  type,  the  variable  takes 
on  only  discrete  values;  e.g.,  the  non-negative  integers  0,  1,  2,  •  •  •  .  The 
two  types  of  distributions  are  called  continuous  and  discrete  distributions, 
respectively.  An  example  of  the  discrete  type  is  obtained  by  making  repeated 
tests  of  10  fuzes  in  many  different  lots.  In  each  test,  x  fuzes  are  found 
defective,  where  x  may  take  on  any  one  of  the  1 1  values  0,  1,  2,  •  •  •  ,  10, 
but  no  others  (in  particular,  no  fractional  values).  If  a  sample  of  10  fuzes 
is  drawn  from  each  of  50  lots  and  the  50  samples  are  tested,  a  histogram 
may  be  constructed  (Fig.  1.5)  just  as  for  the  rocket  ranges  above,  each 
rectangle  being  of  unit  width  and  centered  at  an  integer.  However,  the 
abutting  rectangles  are  drawn  merely  for  visual  interest,  and  it  should  be 
remembered  that  each  frequency  or  relative  frequency  is  concentrated  at 
a  point. 

If  infinitely  many  lots  could  be  tested,  we  would  obtain  the  theoretical 
model  (the  probability  distribution),  but  this  distribution  would  still  be 
represented  graphically  by  rectangles  rather  than  by  a  smooth  curve.  See 
Sec.  1.2.2  on  the  binomial  distribution  for  the  most  important  example  of  a 
discrete  probability  distribution;  this  distribution  can  be  used  as  a  model  for 
the  fuze-testing  problem  above  and  many  similar  problems. 


1.1.3.  Measures  of  Central  Location 

As  indicated  above,  the  data  can  be  reduced  considerably  by  grouping, 
and  a  good  over-all  picture  of  the  values  can  be  obtained  by  graphing.  To 
reduce  the  sample  data  further,  we  shall  want  to  know  where  the  values 
are  centered.  We  can  use  one  or  the  measures  in  the  following  paragraphs 
(Sec.  1.1.3a-l.i.3d)  to  summarize  that  information.  Of  the  four  measures, 
the  first  is  the  most  common. 


a.  Mean.  The  mean  or  average 


*  =  -(*.  +  +  •  •  •  +*0  =  ^2*. 

of  the  n  sample  readings  is  familiar  as  a  description  of  the  central  location 
of  school  grades.  For  the  data  of  Table  1.1 


1 

x  =  (1,472  +  1,315  +  •  •  •  +  1,606)  =  1,589.8  yards 

A  mean  can  be  defined  for  the  whole  population  also,  though  it  must 
be  defined  slightly  differently  if  the  population  is  infinite.  Physically,  the 
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FIG.  1.5.  A  Discrete  Probability  Distribution. 

mean  fi  is  the  x-coordinate  of  the  center  of  gravity  of  the  area  under  the 
probability  density  curve  for  the  population,  "he  mean  x  of  the  sample 
readings  is  generally  the  be..;  °stimate  of  the  mean  /m  of  the  population  from 
which  the  readings  were  drawn. 

b.  Median.  The  median  is  the  halfway  point  in  the  readings  when 
they  have  been  arranged  in  order  of  size  (the  middle  reading  of  an  odd 
number  of  readings,  or  the  average  of  the  middle  two  for  an  even  number). 
For  the  data  of  Table  1.1,  the  median  is  1,588.5  yards,  halfway  between 
1,571  and  1,606. 

The  population  median  is  the  fiftieth  percentile  of  the  population;  that 
is,  the  number  m  such  that  the  area  under  the  distribution  curve  to  the  left 
of  m  is  0.5. 

The  population  median  and  mean  may  not  be  equal.  In  the  case  of  the 
normal  distribution  (Sec.  1.2.1),  where  the  two  are  equal,  the  sample 
median  may  be  used  to  estimate  the  population  mean;  however,  the  median 
is  not  so  good  an  estimate  as  is  the  sample  mean,  in  the  sense  that  it  does 
not  draw  the  maximum  amount  of  information  from  the  sample.  The 
median  is  less  affected  by  a  few  wild  readings  than  is  the  mean. 
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c.  Mode.  A  mode  is  a  peak  value  of  the  frequency  distribution.  A  single 
mode  of  high  frequency  gives  a  rough  but  quick  measure  of  central  location. 
The  idea  of  mode  is  not  useful,  however,  if  the  distribution  has  more  than 
one  high  point,  as  in  Fig.  1.2,  or  does  not  show  a  single,  well-defined  peak. 
More  than  one  high  point  may  indicate  the  need  for  regrouping  the  data. 

d.  Midrange.  The  midrange  is  the  average  of  the  smallest  and  the 
largest  readings.  This  is  a  good  measure  of  the  central  location  for  samples 
of  five  or  fewer,  though  it  is  not  so  good  as  the  mean. 

1.1.4.  Measures  of  Dispersion 


In  reducing  sample  data,  it  is  also  necessary  to  know  how  the  values  are 
distributed  about  their  center.  There  are  several  common  measures  of 
dispersion: 

a.  Standard  Deviation.  The  standard  deviation  s  of  a  sample  of  n 
observations  xu  x2,  •  • 


Xn  IS 


■Irh-jfc-’)*] 


The  variance  s2  of  the  sample  values  is  the  square  of  r;  that  is,  essentially 
the  average  of  the  squares  of  distances  from  the  mean — aside  from  a  factor 
n/(n-  1). 

The  population  parameters  corresponding  to  s  and  j’  are  called  the 
population  standard  deviation  and  variance,  respectively,  and  are  denoted 
by  <r  and  a2,  following  the  convention  of  using  Greek  letters  for  population 
parameters. 

The  reason  for  defining  the  sample  variance  s 2  with  n  —  1  rather  than  n 
in  the  denominator  is  that  the  sample  variance  is  of  most  importance  as  an 
estimate  of  the  population  variance  a 2  and,  when  defined  as  above,  it  is  an 
unbiased  estimate  of  a2.  "Unbiased”  indicates  that  the  values  of  s2  from 
all  the  hypothetically  possible  samples  from  the  same  population  will 
average  to  a2.  The  sample  variance  s2  is  the  best  estimate  of  <r2  in  the  case 
of  a  normal  population.  (As  an  estimate  of  or,  s  is  slightly  biased.  See 
Ref.  1.3,  p.  72.) 

The  computing  of  s2  is  simplified  by  using  the  algebraically  equivalent 
form 


t  i 
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S2 


1 

rtjr^T) 


For  the  data  of  Table  1.1 


o  _  50(1, 4722  +  1,3152  +  •  •  •  +  1,6062)  -  (79.490)2 

(5C) (49) 

s  =  220.6 


48,683 


b.  Range.  The  range  w  is  the  difference  between  the  smallest  and  the 
largest  readings  in  the  sample.  In  the  case  of  a  normal  distribution  (Sec. 
1.2.1),  the  range  multiplied  by  the  appropriate  factor  from  Appendix 
Table  12  gives  a  good  measure  of  the  population  a  for  a  small  sample 
( n  <  10),  though  not  so  good  as  r  (Ref.  1.3,  p.  239).  For  the  data  of 
Table  1.1,  the  range  is  2,107  —  1,080  =  1,027.  Using  the  factor  0.222 
from  Table  12,  we  get  an  estimate  0.222  X  1,027  S  228  for  a. 

c.  Mean  Deviation.  The  mean  deviation  is  the  average  of  the  dis¬ 
tances  from  the  mean  of  the  n  sample  members  (all  distances  being  taken 
as  positive).  The  mean  deviation  is  also  often  defined  as  the  average  of 
distances  from  the  median  rather  than  from  the  mean.  For  estimating  cr  in 
a  normal  population,  the  mean  deviation  is  not  so  efficient  as  s,  but  can  be 
multiplied  by  a  factor  (Ref.  1.3,  p.  240)  to  give  a  fairly  good  estimate  for 
sample  sizes  n  <  10. 

d.  Various  Percentile  Estimates.  Estimates  of  the  population  a  based 
on  percentiles  of  the  sample  can  be  made,  but  they  are  not  so  good  as  the 
sample  standard  deviation.  For  instance,  (PU3  -  P07)  (0.3388)  can  be  used. 
To  find  P03,  arrange  the  values  in  order  of  size  and  count  off  93%  of  them. 
The  value  that  appears  nearest  that  point  is  used  for  Py3.  Other  percentile 
estimates  using  more  than  two  percentiles  (and  consequently  deriving  more 
information  from  the  sample)  can  be  found  in  Ref.  1,3,  p.  231.  For  the 
data  of  Table  1.1,  we  find  (P93  —  P07)  (0.3388)  =  230,  as  compared  with 
s  -  220.6. 

e.  Probable  Error.  The  probable  error  (PE)  is  a  deviation  from  the 
population  mean  /x  such  that  50%  of  the  observations  may  be  expected  to 
lie  between  /x  —  PE  and  /x  +  PE.  For  the  normal  distribution ,  Appendix 
Table  2  shows  that  the  probable  error  equals  0.674cr.  The  probable  error 
can  be  converted  to  standard  deviation  by  multiplying  by  1.4826.  The  more 
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modern  standard  deviation  is  in  wider  use  as  a  measure  of  dispersion  than 
is  the  probable  error,  because  the  standard  deviation  arises  naturally  in  the 
derivations  that  underlie  the  statistical  tests  in  common  use. 

1.1.5.  Coding 

Computation  of  the  measures  defined  in  Sec.  1.1.3  and  1.1.4  can  often 
be  simplified  by  coding  the  data;  that  is,  by  replacing  the  readings  them¬ 
selves  by  coded  numbers  (usually  shorter)  at  the  beginning  of  computation 
and  decoding  at  the  end.  The  following  methods  may  be  used:  (1)  sub¬ 
tracting  a  constant,  (2)  multiplying  by  a  constant,  or  (3)  subtracting  a 
constant  and  multiplying  by  a  constant.  For  instance,  in  computing  the 
mean  of  the  readings  in  Table  1.1,  subtracting  1,000  from  each  reading  can 
simplify  the  procedure.  Thus,  instead  of 

x  =  —  (1,472  +  1,315  +  1,984  +  •  •  •  +  1,606)  =  1,589.8 
the  equation  would  read : 

x  =  1,000  +  (472  +  315  +  984  +  •  •  •  +  606)  =  1,589.8 

Another  example  of  coding  is  given  in  Sec.  2.2.2. 

1.1.6.  Sampling  Distribution 

Suppose  we  consider  a  population  of  300  individuals.  Taking  all  possible 
samples  of  10  (there  would  be  300 !/( 10 !  X  290!),  or  more  than  1018  of 
them)  and  recording  the  sample  x  of  each  would  give  a  population  of  x  s, 
one  for  each  sample  of  10.  That  population  would  have  a  frequency  dis¬ 
tribution,  called  the  sampling  distribution,  of  x  for  samples  of  size  10. 
Similarly,  we  have  sampling  distributions  for  s2,  or  for  any  other  sample 
statistic.  The  sampling  distribution  of  x  has  a  mean  and  a  standard  devia¬ 
tion,  which  can  be  denoted  by  m  and  a ■*.  The  standard  deviation  of  a 
sampling  distribution  is  usually  called  the  standard  error  of  the  statistic. 
For  sampling  from  any  infinite  population  of  x’s,  it  can  be  shown,  for 
example,  that  the  mean  of  the  distribution  of  x  is  equal  to  the  mean  [x 
of  the  population  of  x’s  and  that  o-?,  the  standard  error  of  x,  is  equal  to 
<r/V  n,  where  cr  is  the  standard  deviation  of  the  population  of  x’s  and  n  is 
the  number  of  observations  in  each  sample.  Thus  x  will  tend  to  approximate 
fx  more  closely,  the  larger  th<.  sample  size. 
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1.1.7.  Tes.  of  a  Statistical  Hypothesis 

We  often  drav,  and  analyze  a  sample  for  the  purpose  of  testing  an  initial 
hypothesis,  cal'ed  the  null  hypothesis,  about  the  population.  For  example, 
we  might  state  as  a  null  hypothesis  that  the  variance  o-2  equals  15.  This 
might  represent  a  coded  variance  of  0.0015  in2  on  a  molded  part.  It  can  be 
shown  that  the  fraction  («  -  l)j2/<r2  has  a  certain  sampling  distribution, 
which  can  be  found  in  tables.  For  example,  if  o 2  is  indeed  15  and  the 
population  has  a  certain  type  of  distribution  (the  normal),  then  the  statistic 
(n  -  1)j2/15  computed  from  samples  of  size  11  has  the  sampling  distribu¬ 
tion  shown  in  Fig.  1.6.  The  tabled  distribution  so  graphed  shows  that  the 
probability  that  the  statistic  1  Oj2/  1 5  falls  below  3.25  or  above  20.5  (that  is, 
the  area  under  those  portions  of  the  probability  density  curve)  is  only  5% 
if  <r2  is  actually  15. 

To  perform  the  test,  we  compute  1  Ox2/ 1 5  from  the  sample.  If  it  falls 
outside  the  interval  3.25  to  20.5,  we  reject  the  null  hypothesis  that  a2  is  15 


FIG.  1.6.  Frequency  Distribution  of  (n  —  l)  rW  (for  n  =  11  and  <rs  =  15). 
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on  the  ground  that  if  <r2  were  15,  such  a  large  or  small  value  of  10j2/15 
would  be  quite  improbable. 

In  this  example  we  have  rejected  the  hypothesis  whether  10j2/15  was  too 
large  or  too  small,  by  using  what  is  called  an  equal-fails  test,  with  2.5% 
at  each  end  of  the  area  under  the  distribution  curve  included  in  the  rejection 
region.  If  we  were  interested  only  in  whether  the  variance  a2  was  not  greater 
than  15,  we  would  use  the  one-sided  test  with  5%  rejection  region 
10j2/15  >  18.3. 

If  we  do  not  reject  the  null  hypothesis,  we  do  not  then  categorically 
conclude  that  the  hypothesis  is  true  (see  the  last  paragraph  of  Sec.  1.1.8). 
We  merely  recognize  that  the  sample  is  compatible  with  the  kind  of  popula¬ 
tion  described  in  the  null  hypothesis;  that  is,  that  the  statistic  computed 
cannot  be  considered  extreme  if  the  null  hypothesis  holds. 

1.1.8.  Level  of  Significance  a  and  Types  of  Error 

In  the  example  given  above,  we  reject  the  null  hypothesis  if  the  observed 
10j2/15  falls  in  a  region  of  extreme  values  to  which  the  null  hypothesis 
assigns  probability  only  0.05.  Here,  5%  is  called  the  level  of  significance 
of  the  test  and  is,  in  general,  denoted  by  a.  There  is  a  5%  risk  of  error,  for 
even  if  the  null  hypothesis  does  hold,  there  is  a  5%  probability  that  we  will 
reject  it.  This  type  of  error,  the  rejection  of  the  null  hypothesis  when  it  is 
true,  is  called  a  Type  I  error.  The  risk  of  Type  I  error  is  a,  the  level  of 
significance.  This  level  is  arbitrary,  but  should  be  chosen  when  the  experi¬ 
ment  is  designed. 

Sometimes  when  it  is  important  to  guard  against  Type  I  errors,  as  in 
testing  an  expensive  product,  a  level  a  of  1%  or  0.1%  is  used.  The  level 
of  5%  is  used  throughout  this  Manual.  (In  some  texts,  results  that  are 
significant  at  the  5%  level  are  marked  with  a  star.) 

The  failure  to  reject  a  false  hypothesis  is  called  a  Type  II  error.  The  risk 
of  a  Type  II  error  is  denoted  by  /?.  In  the  example  above,  suppose  that 
the  variance  is  not  equal  to  15,  but  has  some  alternative  value,  say  25.  It  is 
possible  that  the  statistic  10j2/15  will  fall  between  3.25  and  20.5,  leading 
us  to  accept  (fail  to  reject)  the  hypothesis  that  a2  is  15,  though  it  is 
actually  25.  The  probability  of  making  a  Type  II  error  depends  on  the 
alternative  value;  for  example,  we  would  be  less  likely  to  accept  the  null 
hypothesis  when  a2  equals  25  than  when  a2  equals  16. 
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1.1.9.  Operating-Characteristic  Curve 

An  operating-characteristic  (OC)  curve  shows  the  probability  of  a  Type 
II  error  for  different  alternatives.  Examples  appear  in  Appendix  Charts 
V-VIII.  These  curves  are  useful  in  determining  the  sample  size  for  an 
experiment.  First,  we  choose  the  level  of  significance  a  at  which  a  test  is  to 
be  conducted  (i.e.,  the  risk  that  we  can  take  of  rejecting  the  null  hypothesis 
if  it  is  true).  Then  we  decide  on  an  alternative  that  we  wish  to  detect  or 
guard  against  and  the  risk  /?  that  we  are  willing  to  take  of  making  a  Type  II 
error.  (It  is  at  this  step  that  we  key  statistical  significance  to  practical  sig¬ 
nificance  in  the  problem  at  hand.)  The  OC  curves  will  show  what  sample 
size  will  satisfy  the  two  conditions.  The  confidence  interval,  explained 
below,  is  also  useful  in  determining  sample  size.  Details  of  these  two 
methods  of  determining  sample  size  are  given,  with  particular  problems,  in 
following  chapters  of  this  Manual. 

Most  of  us  have  some  intuition  about  permissible  risks  in  ordinary 
problems,  but  some  experimenters  may  object  that  it  is  about  as  difficult  to 
specify  the  risks  a  and  /?  as  to  specify  a  sample  size,  or  more  difficult  if  a 
budget  has  already  been  fixed.  Even  if  the  sample  size  has  already  been 
specified,  it  is  wise  to  determine  the  consequent  risks  from  the  appropriate 
OC  curve. 

An  alternative,  but  more  difficult,  method  of  determining  sample  size  is 
to  choose  it  so  that  the  total  cost  is  at  a  minimum.  If  we  denote  the  null 
hypothesis  under  test  by  H0  and  the  alternative  we  wish  to  detect  by  Hlt  and 
we  make  final  acceptance  of  one  or  the  other  on  the  basis  of  the  experiment, 
then  the  total  cost  is  given  by  the  equation 

Total  cost  =  (risk  of  accepting  H0  if  Hj  is  true) 

X  (cost  of  accepting  H0  if  H1  is  true) 

+  (risk  of  accepting  H x  if  H0  is  true) 

X  (cost  of  accepting  if  H0  is  true) 

+  (cost  of  experiment) 

As  the  sample  size  increases,  the  first  two  terms  decrease  (because  the  risks 
decrease),  and  the  last  term  increases.  Thus,  the  curve  of  total  cost  against 
sample  size  may  be  expected  to  decrease  at  first  but  to  increase  ultimately. 
The  sample  size  giving  the  minimum  point  is  the  appropriate  choice.  The 
risks  can  be  evaluated  by  the  theory  of  probability,  but  the  costs  must  be 
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specified  by  the  user  of  the  experimental  results.  This  specification  may 
be  difficult;  consider,  for  example,  the  danger  of  accepting  the  poorer  of 
two  weapons  for  general  use  for  an  unknown  number  of  years  to  come. 
However,  the  approach  may  be  of  value  in  many  problems. 

1.1.10.  Confidence  Intervals 

In  estimating  a  population  measure,  say  /*,  by  a  sample  measure,  say  x, 
we  need  not  be  content  with  recording  that  n  has  a  value  somewhere 
near  x.  Actually,  we  can  make  use  of  the  sampling  distribution  of  x  to 
construct  an  interval  about  x  and,  with  a  specified  confidence,  state  that  fx 
lies  in  the  interval.  For  a  95%  confidence  interval,  the  statement  will  be 
correct  95%  of  the  time,  in  the  sense  that  if  we  drew  many  samples  and 
computed  x  and  the  corresponding  confidence  interval  for  each  of  them,  in 
about  95  cases  out  of  100  the  confidence  intervals  would  contain  /x. 

The  end  points  of  a  confidence  interval  are  called  confidence  limits, 
and  the  relative  frequency  with  which  the  interval  will  contain  /x  (in  the 
sense  cited  above)  is  called  the  confidence  coefficient.  (The  confidence 
coefficient  is  frequently  chosen  as  0.95  or  0.99.)  By  specifying  the  length 
of  the  confidence  interval,  we  can  often  determine  sample  size.  v 

If  the  standard  error  of  a  statistic  is  known  or  can  be  estimated,  an 
approximate  95%  confidence  interval  can  be  constructed  easily  by  using  as 
confidence  limits  the  statistic  plus  or  minus  twice  its  standard  error.  For 
example,  such  confidence  limits  for  the  mean  are  x  ±  2<r*  =  x  ±  n  , 
where  or  is  the  standard  deviation  of  an  individual  observation.  If  a  is 
known,  we  can  use  the  length  of  the  confidence  interval,  4 a/yTn  ,  to  deter¬ 
mine  the  sample  size  n.  Thus,  if  cr  is  10  and  we  want  to  hold  n  to  an 
interval  of  length  5,  we  have  4(10) /y[H  =  5,  or  n  —  64. 

1.1.11.  Tolerance  Intervals 

Confidence  limits  enclose  some  population  parameter  (/x,  for  instance) 
with  a  given  confidence.  Tolerance  intervals  are  constructed  from  experi¬ 
mental  data  so  as  to  enclose  P%  or  more  of  the  population  with  the  given 
confidence  1  -  a.  That  is,  if  tolerance  intervals  are  constructed  for  many 
samples  according  to  the  specified  rules,  then  in  100(1  —  a)%  of  the  cases 
they  will  enclose  at  least  P%  of  the  population.  (See  Sec.  4.9.)  The  percent¬ 
age  P  may  be  called  the  population  coverage. 
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The  tolerance  intervals  discussed  here  are  computed  from  the  sample  to 
show  where  most  of  the  population  can  be  expected  to  lie.  They  are,  then, 
different  from  the  tolerances  given  in  specifications  which  show  where  all 
acceptable  items  must  lie. 

1.1.12.  Degrees  of  Freedom 

In  the  language  of  physics,  a  point  that  can  move  freely  in  three- 
dimensional  space  has  three  degrees  of  freedom.  Three  variable  coordinates, 
x,  y,  and  z,  can  take  on  different  values  independently.  If  we  constrain  the 
point  to  move  in  a  plane,  it  has  just  two  degrees  of  freedom,  this  fact  being 
shown  by  a  dependency  relation  among  x,  y,  and  z:  ax  +  by  +  cz  —  d. 

A  similar  concept  is  used  in  statistical  language.  The  sum  of  n  squares 
of  deviations  from  the  sample  mean 

2(xi-x)2  =  (x1-x)2+  •  •  •  +  (xn  -  x)2 

i=l 

where 


is  said  to  have  just  n  —  1  degrees  of  freedom,  for  if  x  is  fixed,  only  n  —  1 
of  the  x’s  can  be  chosen  independently  and  the  «th  is  then  determined. 
The  sampling  distributions  of  some  statistics  depend  on  the  number  of 
degrees  of  freedom.  The  most  common  examples  of  such  statistics  are  t, 
or  Student’s  t  (Chapter  2),  x2  (Chapters  3  and  4),  and  F  (Chapters 
3  and  5).  Their  distributions  are  tabulated  in  the  Appendix. 

1.2.  Particular  Dr  ributions 

1.2.1.  Normal  Distribution 

The  normal  distribution,  which,  has  the  probability  density  function 

1 

=  £-(*-ji)V2 a* 

V  2tt<t 

with  mean  /j.  and  standard  deviation  a,  is  of  particular  interest  in  both 
applied  and  theoretical  statistics.  It  occurs  frequently  in  practical  problems 
and  is  easy  to  use  because  its  properties  have  been  thoroughly  investigated. 
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The  normal  distribution  is  illustrated  in  Fig.  1.7  in  terms  of  the  standard¬ 
ized  variable  z  =  (x  -  /a)  /a.  The  transformation  or  change  of  variable 
z  =  (x  —  [x)/a  is  in  common  use,  for  it  permits  us  to  use  one  table  for  all 
normal  distributions,  irrespective  of  their  different  means  and  variances. 
The  distribution  of  z  is  normal  with  zero  mean  and  unit  standard  deviation. 
The  normal  curve  is  symmetric,  with  its  one  mode  (or  peak)  and  its  median 
at  the  mean  /t. 

If  each  of  several  independent  variables  x{  has  a  normal  distribution  with 
mean  m  and  variance  a],  then  the  sum  y  =  Xc{Xiy  where  the  are  constants, 
has  a  normal  distribution  with  mean  and  variance  %c\a\. 

The  cumulative  distribution  function  of  the  normal  distribution  is 


F(x)  = 


1 

V  2i TO 


A  typical  curve  of  this  type  is  given  in  Fig.  1.8.  Note  that  the  height  of  the 
curve  F(x)  at  a  point  c  equals  the  area  under  the  curve  f(x)  to  the  left  of  c. 


fU) 


FIG.  1.7.  Probability  Density  Function  of  the  Normal  Distribution. 
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FIG.  1.8.  Cumulative  Distribution  Function  of  the  Normal  Distribution. 


Values  of  the  cumulative  distribution  function  corresponding  to  values 
of  z  are  given  in  Appendix  Table  1  along  with  a  sketch  showing  the  area 
under  the  normal  curve  as  it  appears  in  a  particular  case.  Values  of  z 
corresponding  to  an  area  F  are  given  in  Appendix  Table  2.  The  following 
examples  Hustrate  the  use  of  these  tables. 

Example.  If  *  has  a  normal  distribution,  mean  10,  and  standard 
deviation  20,  what  is  the  probability  of  a  random  observation  falling 
between  15  and  30?  Between  —5  and  20? 


For  the  first  question,  we  find 


30  -  10 
20 


=  1.0 
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From  Appendix  Table  1,  the  probability  of  a  z  less  than  zx  is  0.5987, 
and  the  probability  of  a  z  less  than  z2  is  0.8413.  Then  the  probability  of 
a  z  between  zx  and  z2  is  0.8413  —  0.5987  =  0.2426.  (See  Fig.  1.9.) 


For  the  second  question,  we  have 


= 


-5  -  10 
20 


-0.75 


Z0  = 


(L  _  20  -  10 


20 


=  0.50 


FIG.  1.9. 


From  symmetry,  the  probability  of  a  z  less  than  0.75  is  the  same  as  the 
probability  of  a  z  greater  than  -0.75.  From  Appendix  Table  1,  the  prob¬ 
ability  of  a  z  less  than  0.75  is  0.7734;  hence  the  probability  of  a 
z  less  than  -0.75  is  1  —  0.773 4  =  0.2266.  The  probability  of  a  z  less 
than  z2  is  0.6915.  The  probability  of  a  z  between  zx  and  z2  is  then 
0.6915  —  0.2266  =  0.4649.  (See  Fig.  1.10.) 
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Example.  What  is  the  probability  that  a  random  normal  number  will 
lie  within  0.1  a  units  of  the  mean? 

We  compute  (  +  0  7(r)  _ 

z  = - -  =  — - - - -  =  0.7 

(T  <T 

From  Appendix  Table  1,  the  probability  of  a  z  less  than  0.7  is  0.7580. 
The  probability  ..  a  z  less  than  zero  is  0.5000.  Subtracting,  the  probability 


-10  0  10  20  30 


FIG.  1.10. 


of  a  z  between  zero  and  0.7  is  0.7580  —  0.5000  =  0.2580.  From  sym¬ 
metry,  the  probability  of  a  reading  within  0.7  to  the  left  of  the  mean  is 
also  0.2580.  The  total  probability,  then,  is  2(0.2580)  =  0.5160.  (See 
Fig.  1.11.) 

Example.  If  x  has  a  normal  distribution  with  standard  deviation  10, 
find  a  balanced  interval  around  the  mean  so  that  the  probability  is  0.95 
that  a  random  reading  will  fall  in  the  interval. 
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Since  an  area  of  (1  -  0.95)/2  =  0.025  must  lie  to  the  left  of  the 
interval,  we  read  from  Appendix  Table  2  that  z  =  -1.960.  From 


a 


we  have 

x  —  fi  +  za  —  ji  +  (  — 1.960)  (10)  —  i*  —  19.60 

The  desired  interval  is  from  [x  —  19.60  to  /a  +  19.60.  (See  Fig.  1.12.) 

From  Appendix  Table  2,  we  see  that  50%  of  a  normal  population 
falls  within  0.674<r  of  the  mean.  From  Appendix  Table  1,  68.3%  falls 
within  la  of  the  mean;  95.4%  within  2 <7;  and  99.7%  within  3 a. 

Some  of  the  uses  of  the  normal  distribution  function  are  listed  below.  It 
can  be  used: 

1.  As  a  curve  to  "fit”  a  set  of  data  which  appear  to  be  approximately 
normally  distributed  when  the  sample  is  fairly  la.ge  (>  50).  (See 
Chapter  4  for  tests  of  normality.)  The  cumulative  normal  distribution 
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is  "fitted”  by  substituting  the  sample  mean  and  standard  deviation  for 
the  corresponding  population  values  in  the  equation  for  F(x). 

2.  As  an  approximating  distribution  for  certain  sample  statistics,  such  as 
the  mean,  when  the  sample  size  is  fairly  large  (>  30  for  the  mean). 

3.  As  an  approximation  to  other  distributions  such  as  the  binomial  (to 
be  discussed  subsequently)  when  the  samples  are  fairly  large  and 
certain  other  criteria  are  satisfied.  (See  Ref.  1.11,  pp.  152-59.) 

4.  As  the  basic  distribution  in  most  practical  applications,  for  tests  of 
significance,  quality-control  work,  design  of  experiments,  and  variables 
sampling;  that  is,  the  variable  x  under  consideration  is  assumed  to  be 
normally  distributed.  (See  Chapter  4  for  tests  of  normality.) 

One  of  the  applications  of  considerable  interest  on  this  Station  is  deter¬ 
mining  the  probability  of  a  hit  on  a  target  when  the  shots  have  a  normal 
distribution.  The  technique  to  be  applied  depends  on  the  shape  and  position 
of  the  target.  The  following  examples  are  typical. 

Example  (One  Dimension).  A  plane  flies  low  above  a  highway  and 
drops  bombs.  Call  the  perpendicular  distance  from  the  highway  center 
to  each  bomb-impact  point  its  x-distance  (positive  to  one  side  of  the 
road,  negative  to  the  other),  and  assume  that  the  variable  x  has  a  normal 
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distribution  with  mean  zero  and  standard  deviation  20  yards.  What 
proportion  of  the  bombs  dropped  can  be  expected  to  fall  within  6  yards 
of  the  center  of  the  highway?  (See  Fig.  1.13.) 

We  have  zx  —  (-6  -  0)/20  =  —0.3,  and  z2  =  (6  —  0)/20  =  0.3. 
From  Appendix  Table  1,  F(0. 3)  —  0.6179.  Then  the  expected  pro¬ 
portion  of  successful  shots  (within  6  yards  of  the  highway  center)  is 
2(0.6179  ~  0.5)  =  0.2358. 

Example  (Two  Dimensions,  Rectangular  Target).  Suppose  shots 
aimed  at  a  rectangular  target  are  subject  to  random  influences  such  that 


FIG.  1.13. 
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the  horizontal  deviations  x  and  the  vertical  deviations  y  of  the  shots  are 
independent  and  normally  distributed.  (See  Fig.  1.14.)  What  propor¬ 
tion  of  shots  can  be  expected  to  hit  the  target  ? 

Since  the  horizontal  coordinate  and  the  vertical  coordinate  are  inde- 
x  pendent,  the  probability  that  a  particular  shot  will  fall  in  the  target  area 

is  the  probability  that  its  horizontal  coordinate  will  fall  in  the  interval 
(a,  b )  cut  off  by  the  target,  denoted  by  P(a  <  x  <  b),  multiplied  by  the 
probability  that  its  vertical  coordinate  will  fall  in  the  interval  (c,  d )  cut 
off  by  the  target,  denoted  by  P(c  <.  y  <  d).  Thus  the  expected  propor¬ 
tion  is  P(a  <  x  <  b)  • P(c  <  y  <  d).  Each  of  these  probabilities  can  be 
found  by  use  of  Appendix  Table  1.  For  instance,  if  the  mean  values  of 
x  and  y  are  both  zero  and  a  =  -0. 67450*,  b  =  0.6745ax,  c  =  —  0.6745cry, 
and  d  =  0.6745<rv,  then  P(a  <  x  <  b)  =  0.5000  and  P(c  <  y  <  d)  = 
0.5000.  Hence,  the  probability  of  a  shot  lying  in  the  corresponding 
rectangle  is  0.2500. 

Example  (Two  Dimensions,  Circular  Target).  Suppose  shots  aimed 
at  a  circular  target  are  subject  to  random  influences  such  that  the  hori- 


y 


FIG.  1.14. 
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FIG.  1.15. 


zontal  positions  and  the  vertical  positions  are  independent  and  normally 
distributed,  with  mean  at  the  target  center  and  common  standard  devi¬ 
ation  <r.  What  proportion  of  shots  can  be  expected  to  fail  within  r  units 
of  the  target  center?  (See  Fig.  1.15.) 

By  calculus  (integration  in  polar  coordinates)  the  probability  can  be 
determined  as 

1  -  e-r2/2o- 

where  e  is  the  base  of  natural  logarithms.  Thus,  if  r  is  twice  the  standard 
deviation,  wc  have 


***** 
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1  -  *-<**> */*»*  =  1  -  e*  =  1  -  0.135  =  0.865 

as  the  proportion  expected  to  hit  the  target  in  this  case.  The  radius  of 
the  circle  centered  at  the  mean  which  contains  50%  of  the  shots  (called 
the  circular  probable  error)  is  2  log*  2  =  1.1774<r.  The  root-mean- 
square  radial  error — that  is,  the  square  root  of  the  average  of  the  squared 
radial  distances  from  the  mean  point — is  2  =  1.4l42<r. 

Example  (Two  Dimensions,  Any  Target).  For  other  target  shapes 
and  positions  a  chart  technique  is  available.  Again  we  assume  that  the 
horizontal  and  vertical  deviations  have  independent  normal  distributions. 
The  target  is  drawn  to  scale  on  tracing  paper  and  held  over  Appendix 
Chart  I.1  (In  practice,  it  is  preferable  to  use  a  suitable  copy  of  Appendix 
Chart  I  mounted  on  cardboard.)  A  count  of  the  number  of  rectangles 
covered  by  the  target  gives  the  probability  of  a  hit,  each  rectangle  repre¬ 
senting  a  probability  of  0.001.  The  scales  for  drawing  the  target,  possibly 
different  for  each  of  the  two  perpendicular  directions,  are  fixed  by  the 
condition  that  the  standard  deviation  of  the  shots  in  each  direction  must 
correspond  to  the  length  marked  on  the  chart.  Since  the  chart  shows  only 
one  quadrant,  it  must  be  rotated  through  four  settings  to  give  a  complete 
count. 

Suppose  that  points  of  impact  of  a  type  of  rocket  have  a  two- 
dimensional  normal  distribution  about  the  target  point  with  standard 
deviation  of  200  yards  in  range  and  60  yards  in  lateral  deflection.  What 
is  the  probability  that  a  rocket  will  land  within  a  circle  of  100-yard 
radius  about  the  target  point? 

In  order  to  use  Appendix  Chart  I  to  obtain  the  probability,  we  must 
first  draw  the  target  to  scale,  so  that  the  lengths  marked  "1  standard 
deviation”  on  the  chart  will  correspond  in  the  range  direction  to  200  yards 
and  in  the  lateral  direction  to  60  yards.  The  circle  becomes  an  ellipse, 
one  quadrant  of  which  (superimposed  on  Chart  I)  is  shown  in  Fig.  1.16. 
A  count  of  the  chart  rectangles  in  this  quadrant  covered  by  the  ellipse 
gives  73.3.  Multiplying  by  4  to  allow  for  the  other  (symmetric)  quad¬ 
rants,  we  have  293.2.  Since  each  rectangle  corresponds  to  a  probability 
of  0.001,  the  probability  of  a  hit  within  100  yards  of  the  target  is 
about  0.293. 

1  The  rectangular  chart  was  introduced  in  the  Bureau  of  Ordnance  during  World 
War  II  by  R.  S.  Burington,  E.  L.  Crow,  and  A,  D.  Sprague. 
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1 .2.2.  Binomial  Distribution 

Suppose  each  sample  member  is  to  be  judged  on  some  yes-or-no  basis, 
such  as  a  hit  or  a  miss,  a  good  item  or  a  defective  item,  a  head  or  a  tail. 
Suppose,  further,  that  a  random  item  has  probability  p  of  being  a  hit  and 
probability  q  of  being  a  miss.  Since  it  must  be  one  or  the  other,  we  have 
p  +  q  —  1.  The  binomial  distribution  is  defined  by 


Pn(x)  =  Cnxpxq"~*  (x  =  0,1,2,  •  •  •  ,«) 

where 

=  probability  of  getting  exactly  x  hits  in  n  trials 
Cnx  —  number  of  combinations  of  n  things  taken  x  at  a  time,  also 
denoted  by  C(»,x),  „C.f,  and  (”) 

_  n\  _»(»- 1)(» -2)  •••(»- x  +  1) 
x\ (n-x)\  (1 ) (2) (3)  .  .  .  (x) 

with  0 !  defined  as  1 . 
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The  probability  of  getting  at  least  r  hits  in  n  trials  is 

I,Pn(x) 

x~r 

The  mean  and  variance  of  x ,  the  number  of  successes  in  n  trials,  are  given 
by  ix  —  np  and  a 2  =  npq. 

Some  of  the  uses  of  the  binomial  distribution  function  follow.  It  can 
be  used: 

1.  As  a  discrete  distribution  to  fit  a  set  of  data  which  are  classified  in 
two  groups.  (When  just  one  sample  of  size  n  is  available,  the 
binomial  distribution  indicates  how  well  the  unknown  parameter  p 
is  estimated  by  the  sample  proportion  x/n.  See  Sec.  2.3.) 

2.  As  the  basic  distribution  in  attributes  sampling.  (See  Chapter  8.) 

Tables  of  the  binomial  distribution  are  available  (Ref.  1.9  and  2.1). 

Example.  Suppose  that  all  the  shots  of  a  certain  series  are  sure  to  hit 
a  circular  target  (Fig.  1.17),  and  that  the  probability  of  a  single  shot 
landing  in  any  particular  area  of  the  target  is  proportional  to  the  area. 
Note  that  this  differs  from  the  normality  assumption  of  Sec.  1.2.1.  Then 
the  single-shot  probability  p  of  landing  in  the  shaded  bull’s-eye  is  the 
ratio  of  this  area  to  the  total,  that  is  7r(1.5)2/7r(3)2  =  The  single  shot 


TABLE  1.3.  Binomial  Probabilities  for 
n  -  10  and  p=  y4 


No.  of 
bull's-eyes,  x 

Binomial 

probability 

cm*  (v4y°-x 

Binomial 

cumulative 

function 

0 

0.0563 

0.0563 

1 

0.1877 

0.2440 

2 

0.2816 

0.5256 

3 

0.2503 

0.7759 

4 

0.1460 

0.9219 

5 

0.0584 

0.9803 

6 

0.0162 

0.9965 

7 

0.0031 

0.9996 

8 

0.0004 

1.0000 

9 

0.0000 

1.0000 

10 

0.0000 

1.0000 
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FIG.  1.17. 

probability  c]  of  missing  the  bull’s-eye  is  %.  In  Table  1.3,  the  number  n  of 
shots  is  taken  as  10.  For  each  entry  in  the  first  column,  the  second 
column  gives  the  binomial  probability  of  getting  exactly  that  number 
of  bull’s-eyes  out  of  the  10  shots.  The  last  column  gives  the  cumulative 
binomial  probability  of  getting  that  number  of  bull’s-eyes  or  fewer.  The 
results  are  graphed  in  Fig.  1.18 

1.2.3.  Poisson  Distribution 

As  an  approximation  to  the  binomial  distribution  when  n  is  large  and  p 
is  small,  there  is  the  Poisson  distribution 


Pr,fx) 
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mxe'm 

p(x)  =-7r  (x  =  0,  1,  2,  •  •  •) 

where  m  =  np  of  the  approximated  binomial  distribution  and  e  —  2.71828, 
the  base  of  natural  logarithms.  Note  that  0!  =  1  and  m°  —  1. 


FIG.  1.18.  Binomial  Distribution  for  n  —  10  and  p  —  Yi, 
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The  mean  and  variance  of  the  Poisson  distribution  are  equal:  /x  =  m, 
and  <r2  =  m.  Tables  of  the  Poisson  distribution  are  available  (Ref.  1.8). 

The  Poisson  distribution  is  applicable  in  some  cases  in  its  own  right, 
as  well  as  serving  as  an  approximation  to  the  binomial  distribution.  For 
example,  suppose  that  in  a  manufacturing  process  at  a  pilot  plant  each 
defective  item  occurs  at  a  random  time,  not  influenced  by  the  time  any 
other  defective  is  produced.  Suppose,  further,  that  the  probability  that  d 
defectives  will  be  produced  in  a  given  interval  of  time  is  not  affected  by 
the  number  of  defectives  in  any  other  interval  of  time.  Then,  if  the  average 
number  of  defectives  per  day  is  m ,  the  Poisson  distribution  function 

mxe~m 

P (*)  =~JT~  (*  =  0,1,2,  •  •  •) 

gives  the  probability  of  getting  x  defectives  in  one  day.  If  production 
figures  are  not  in  agreement  with  the  predicted  frequencies,  we  must  decide 
that  defects  are  not  occurring  randomly  and  may  be  affected  by  some  fault 
in  production.  (For  "goodness  of  fit,"  see  Chapter  4.)  In  Table  1.4  (taken 
from  Ref.  1.1),  the  agreement  is  close,  giving  no  reason  to  suspect  a  fault 
in  production.  (See  Sec.  4.2  for  an  exact  test.)  The  method  used  can  be 
applied  to  any  problem  satisfying  similar  conditions.  In  any  investi¬ 
gation  of  defects,  such  as  the  distribution  of  fragments  per  square  yard 


TABLE  1.4.  Fitting  a  Poisson  Distribution 


No.  of  defects 
produced  in 

1  day,  x 

No.  of  occurrences 
of  x  defects 
in  1  day 

Relative  freq.  of 
occurrences  of  x 
defects  in  1  day 

Freq.  of  occurrences 
predicted  by 
Poisson  distr. 

0 

102 

0.5075 

0.4741 

1 

59 

0.2935 

0.3538 

2 

31 

0.1542 

0.1320 

3 

8 

0.0398 

0.0328 

4 

0 

0.0000 

0.0061 

5 

1 

0.0050 

0.0009 

6 

0 

0.0000 

0.0001 

7  or  more 

0 

0.0000 

0.0000 

Total 

201 

1.0000 

0.9998 

Note:  Total  number  of  defects  =  150;  total  number  of  days  on  which  observations 
were  made  =  201;  estimated  mean  number  of  defects  —  m  —  150/201  =  0.74627. 
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on  a  bombarded  target,  the  distribution  of  flaws  per  yard  of  woven  sleeving, 
or  the  distribution  of  number  of  defective  igniter  squibs  per  lot,  we  are 
interested  in  whether  the  occurrence  of  a  flaw  is  a  random  result  of  satis¬ 
factory  production,  or  whether  it  can  be  ascribed  to  faulty  production. 

BIBLIOGRAPHY 

1.1.  Brownlee,  K.  A.  Industrial  Experimentation,  3rd  American  ed. 
Brooklyn,  N.  Y.,  Chemical  Publishing  Co.,  Inc.,  1949.  Chap.  VI, 
"Poisson  Distribution.’’ 

1.2.  Burington,  R.  S.,  and  D.  C.  May,  Jr.  Handbook  of  Probability  and 
Statistics  With  Tables.  Sandusky,  Ohio,  Handbook  Publishers,  Inc., 
1953. 

1.3.  Dixon,  W.  J.,  and  F.  J.  Massey,  Jr.  An  Introduction  to  Statistical 
Analysis.  New  York,  McGraw-Hill,  1951.  Chap.  1-7  and  14-16. 

1.4.  Frankford  Arsenal.  Statistical  Manual.  Methods  of  Making  Experi¬ 
mental  Inferences,  2nd  rev.  ed.,  by  C.  W.  Churchman.  Philadelphia, 
Frankford  Arsenal,  June  1951. 

1.5.  Hald,  A.  Statistical  Theory  With  Engineering  Applications.  New 
York,  Wiley,  1952. 

1.6.  Kendall,  M.  G.  The  Advanced  Theory  of  Statistics,  Vol.  I,  5th  ed. 
London,  Griffin,  1952.  Chap.  1,2,  and  8. 

1.7.  Kendall,  M.  G.,  and  B.  Babington  Smith.  Tables  of  Random  Sam¬ 
pling  Numbers.  Cambridge,  England,  Cambridge  University  Press, 
1939.  (Tracts  for  Computers  No.  24.) 

1.8.  Molina,  E.  C.  Poisson's  Exponential  Binomial  Limit.  New  York, 
Van  Nostrand,  1942. 

1.9.  National  Bureau  of  Standards.  Tables  of  the  Binomial  Probability 
Distribution.  Washington,  GPO,  1949.  (Applied  Mathematics 
Series  6.) 

1.10.  U.  S.  Naval  Ordnance  Test  Station,  Inyokern.  First  Aid  for  Pet 
Projects  Injured  in  the  Lab  or  on  the  Range  or  What  To  Do  Until 
the  Statistician  Comes,  by  R.  M.  McClung.  China  Lake,  Calif., 
NOTS,  29  January  1952.  (Technical  Memorandum  1113.) 

1.11  Wilks,  S.  S.  Elementary  Statistical  Analysis.  Princeton,  N,  J.,  Prince¬ 
ton  University  Press,  1949. 


Chapter  2 

TESTS  AND  CONFIDENCE  INTERVALS  FOR  MEANS 

2.0.  Introduction 

This  chapter  gives  tests  and  confidence  intervals  for  population  means 
based  on  random  samples  from  those  populations.  As  used  here,  ‘'test” 
means  a  test  of  a  hypothesis,  set  up  prior  to  the  experiment,  that  the 
population  mean  has  a  specific  numerical  value.  The  test  yields  simply 
the  conclusion  that  the  data  are  or  are  not  consistent  with  the  hypothesis. 
The  method  of  testing  depends  on  whether  the  population  standard  devi¬ 
ation  is  known  or  unknown.  In  experimentation,  the  case  of  unknown 
standard  deviation  is  the  more  important  and  usual;  in  repetitive  work,  such 
as  production  quality  control  and  standard  tests,  the  standard  deviation 
may  sometimes  be  calculated  closely  from  the  large  amount  of  earlier  data. 

All  statistical  tests  and  confidence  intervals  are  based  on  the  assumption 
that  the  samples  are  random,  and  most  of  them  are  based  on  the  assumption 
that  the  populations  have  normal  distributions.  We  are  rarely  certain  that 
these  assumptions  are  perfectly  satisfied.  Many  large  sets  of  physical  and 
chemical  observations  have  shown  at  least  approximately  normal  distribu¬ 
tions,  and  experimenters  may  reasonably  feel  confident  that  their  variables 
essentially  satisfy  the  normality  assumption  on  the  basis  of  previously 
accumulated  data  of  the  same  type.  The  results  of  statistical  analysis  are 
in  error  an  unknown  amount,  depending  on  the  deparure  of  the  actual 
situation  from  the  assumptions.  The  assumptions  of  randomness  and  nor¬ 
mality  can  themselves  be  objectively  tested  by  methods  given  in  Chapter  4. 

What  is  the  importance  of  the  population  mean?  Suppose  we  are 
interested  in  the  weight  of  a  particular  type  of  rocket.  If  we  weigh  several 
of  them,  we  will  probably  obtain  several  slightly  different  weights.  Thus, 
instead  of  a  single  weight,  we  will  have  a  whole  population  of  weights — 
the  weights  of  all  rockets  of  the  particular  type  considered.  In  fact,  if  we 
make  10  weighings  of  the  same  rocket,  we  will  probably  note  some  variation 
due  to  errors  of  measurement.  Thus,  even  for  one  rocket  the  exact  weight 
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represents  only  an  ideal  measurement;  actually,  there  may  be  a  whole 
population  of  weights  for  each  rocket — the  weights  that  would  be  recorded 
in  all  possible  weighings. 

A  natural  answer  to  the  question,  "How  much  does  this  rocket  weigh?” 
is  the  mean  of  the  population  of  weights  that  would  be  recorded  in  all 
possible  weighings  of  that  particular  rocket.  A  natural  answer  to  the  ques¬ 
tion,  "How  much  do  rockets  of  this  type  weigh?”  is  the  mean  of  the 
population  of  weights  of  ail  such  rockets. 

Thus,  the  mean  enters  into  any  question  of  measurement,  both  because 
of  measurement  errors  and  because  of  actual  variation  of  the  quantity 
measured.  It  is,  then,  highly  desirable  to  have  objective  methods  for  testing 
whether  an  experimental  mean  value  is  consistent  with  some  theoretical 
value  and  for  stating  a  confidence  interval  for  the  population  mean  ("true” 
mean)  on  the  basis  of  a  sample. 

If  it  is  practical  to  take  observations  successively  without  specifying 
beforehand  the  total  sample  size,  the  most  economical  method  for  testing  a 
mean  is  provided  by  "sequential  analysis.”  (See  Sec.  8.4  and  its  references.) 

To  find  the  mean  for  a  quantity  that  cannot  be  measured  directly,  for 
instance  the  minimum  rocket  velocity  that  will  permit  penetration  of  a 
target,  a  "staircase”  test  is  often  desirable.  (See  Sec.  4.5  and  4.5.1.) 

The  rx  test  in  Sec.  2.2.1,  the  rd  test  in  Sec.  2.5.3a,  and  the  sign  test  in 
Sec.  2.5.2a  are  useful  for  rapid  computation,  checking,  or  preliminary 
tests  of  means. 

2.1.  Test  for  Mean  /a  When  Standard  Deviation  <t  Is  Known 

k 

2.1.1.  Assumption;* 

( 1 )  The  sample  is  a  random  sample. 

(2)  The  population  is  normally  distributed  with  known  standard  devi¬ 
ation  o- — or  else  the  sample  size  is  large  (>  30),  in  which  case  s  may  be 
substituted  for  a  (see  Sec.  3.1). 

2.1.2.  Normal  Test 

Given  a  sample  of  size  n ,  to  test  at  a  given  significance  level  a  whether 
the  population  mean  fi  has  the  hypothetical  value  a,  compute  the  statistic 
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x  —  a 
Z= ^ 

The  distribution  of  z  is  normal  with  unit  standard  deviation,  for  er/V» 
is  the  standard  deviation  of  means  x. 

For  an  equal-tails  test  of  the  null  hypothesis  that  ix  =  a>  reject  the 
hypothesis  if  z  falls  in  the  rejection  region  |z|  >  za/2,  where  |z|  is  the 
absolute  or  numerical  value  of  z.  The  value  za/ 2  is  that  exceeded  with 
probability  a/2  by  the  normal  variable  z  if  the  null  hypothesis  holds. 

For  a  one-sided  test  of  the  null  hypo»  esis  that  /x  =  a,  against  the 
alternative  that  /x  >  a,  reject  the  hypothesis  if  z  >  za. 

For  a  one-sided  test  of  the  null  hypothesis  that  fx  =  a,  against  the 
alternative  that  /x  <  a,  reject  the  hypothesis  if  z  <  -za. 

From  Appendix  Table  2,  for  a  5%  level  of  significance  these  rejection 
regions  are 

|z|  >  1.960  for  an  equal-tails  test  of  fx  =  a 

z  >  1.645  for  a  one-sided  test  of  /x  =  a  against  the  alternative  fx  >  a 

z  <  - 1.645  for  a  one-sided  test  of  [x  =  a  against  the  alternative  fx<a 

If  z  falls  in  the  rejection  region  for  the  test  being  made,  reject  the  null 
hypothesis  that  the  population  mean  /t  equals  a  on  the  grounds  that  such  a 
'arge  or  small  value  of  z  would  occur  with  probability  only  5%  if  the  null 
hypothesis  held. 

If  the  population  is  finite  of  size  N,  rather  than  infinite,  replace  the 
above  statistic  z  by 

_ x  —n 

Z~  ^  \N-n_ 

yfn  \  N  -  1 

and  proceed  as  before. 

Notice  that  besides  the  null  hypothesis,  there  are  several  background 
assumptions  in  force:  that  a  is  known,  that  the  statistic  z  has  a  normal 
distribution,  and  that  the  sampling  was  random.  Rejection  of  the  null 
hypothesis  ma,  not  mean  that  the  hypothesis  itself  was  false;  it  may  mean, 
instead,  that  some  of  the  assumptions  underlying  the  construction  of  the 
mathematical  working  model  were  not  satisfied. 
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On  the  other  hand,  failure  to  reject  the  null  hypothesis  is  not  always 
strong  evidence  that  it  holds.  If  the  sample  size  is  small,  only  a  large 
departure  from  the  null  hypothesis  is  likely  to  be  adjudged  significant. 
Hence,  it  is  important  to  plan  experiments  of  sufficient  size  to  hold  risks 
of  error  to  acceptably  low  levels.  Section  2.1.2a  describes  how  to  determine 
suitable  sample  sizes  for  testing  means  when  a  is  known.  Section  1.1.7 
may  be  consulted  for  a  general  discussion  of  statistical  tests. 

Example.  The  mean  range  of  a  type  of  rocket  is  2,000  yards,  and  the 
range  standard  deviation  is  120  yards.  Forty  rounds  fired  after  a  year’s 
storage  give  a  mean  range  of  1,863  yards.  Test  at  the  5%  significance 
level  whether  storage  changes  the  mean  range. 

We  compute 

x  —  a 
<r/V  n 

1,863  -  2,000 

=  — - 7=1= —  =  -7.22 

120/V  40 

For  a  5%  significance  level,  the  equal-tails  rejection  region  is  |z|  >  1.960. 
Since  -7.22  is  less  than  -1.960,  it  falls  in  the  rejection  region,  and 
we  reject  the  null  hypothesis  that  the  mean  /x  equals  2,000  yards.  We 
conclude  that  a  year’s  storage  does  change  the  mean  range  of  this  type 
of  rocket. 

An  equal-tails  rejection  region  was  used  under  the  assumption  that 
before  the  experiment  was  performed  we  were  interested  in  a  change  in 
range  of  this  rocket  in  either  direction.  If  we  had  been  interested  only  in 
whether  the  range  decreased,  we  would  have  used  the  one-sided  rejection 
region  z  <  -1.645. 

a.  Determination  of  Sample  Size.  When  an  experiment  is  to  be  per¬ 
formed  and  the  results  are  to  be  analyzed  by  the  method  of  this  section, 
a  decision  must  be  made  about  the  size  of  the  sample  to  be  used.  Suppose 
we  wish  to  guard  against  the  possibility  of  not  rejecting  the  null  hypothesis 
/x  =  a  if  ju.  actually  has  a  particular  alternative  value  b .  Let  there  be  a 
specified  risk  /?  of  making  such  an  error  (called  a  Type  II  error;  see 
Sec.  1.1.8).  We  adopt  the  level  of  significance  a  (the  risk  of  rejecting 
the  null  hypothesis  when  the  mean  actually  is  a;  that  is,  the  probability 
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Example.  Suppose  that  in  the  example  given  above  in  this  section 
there  is  a  specified  risk  0.2  of  failing  to  detect  a  change  of  ±60  yards 
in  the  2,000-yard  range.  How  large  a  sample  size  should  be  used  in  a 
test  at  the  5%  level  of  significance? 

For  /?  =  0.2  and  an  equal-tails  test,  curve  A  of  Fig.  2.1  gives  the 
alternative  mean  a  ±  2.8cr/\/  n.  Setting  this  equal  to  the  alternative 
b  =  2,000  ±  60,  we  have 

2,000  ±  60  =  2,000  ±  or  n  =  31.4 

V  « 

A  sample  of  32  rockets  is  required,  showing  that  the  sample  of  40 
actually  used  had  a  lower  /?  risk  than  0.2  for  the  alternatives  1,940 
and  2,060  yards. 

If,  in  the  same  example,  we  decide  before  the  experiment  that  we 
need  be  concerned  only  if  the  range  is  decreased  by  storage,  we  will  adopt 
the  one-sided  critical  region  z<  -1.645.  Suppose  there  is  a  specified 
risk  0.2  of  failing  to  detect  a  decrease  of  60  yards  in  the  2,000-yard 
range.  To  determine  the  necessary  sample  size  for  a  test  at  the  5%  level 
of  significance,  refer  to  curve  C  of  Fig.  2.1.  For  (3  =  0.2,  the  curve  gives 
the  alternative  /a  =  a  -  2.5<r/V».  Setting  this  equal  to  the  alternative 
1,940,  we  have 

1,940  =  2,000  -  or  n  =  25 

VT 

For  this  test  only  25  rockets  need  be  used. 

Appendix  Chart  V  serves  the  same  purpose  as  the  equal-tails  curve  of 
Fig.  2.1,  but  is  easier  to  use.  Cc  npute 

<T 

and  enter  the  ch?rt  with  A  and  /?.  Use  the  sample  size  indicated  on  the 
next  lower  cuive,  or  interpolate  between  curves. 

Example.  In  the  example  given  above 
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of  a  Type  I  error).  In  the  examples  given  here,  the  level  a  is  taken  to 
be  0.05  and  the  charts  included  are  applicable  for  that  commonly  used 
level  only. 

From  curve  A,  B,  or  C  of  Fig.  2.1,  depending  upon  the  type  of  rejection 
region  used,  read  off  the  value  of  the  alternative  mean  /x  =  a  +  ko/^  n 
for  the  given  risk  (3.  Since  the  alternative  specified  to  have  risk  (3  was 
\x.  =  b,  we  have 


Using  this  sample  size  assures  us  that  the  experiment  will  enable  us  to 
accept  the  null  hypothesis  fi  =  a  or  the  alternative  hypothesis  \x.  —  b  with 
satisfactorily  small  risks  of  being  wrong  in  either  case. 


a-iir/Vn  a-Zv/Jn  Q-\cr/Jn  q  a^v/'fn  a+Za/Jn 

ALTERNATIVE  MEAN,  n=Q+t<r/Vn 

FIG.  2.1.  OC  Curves  for  the  Normal  Test  at  the  5%  Level  of  Significance.  (A)  Equal- 
tails  test,  (B)  one-sided  test  for  an  alternative  b  >  a,  and  (C)  one-sided  test  for  an 
alternative  b  <  a. 
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In  Apf  ndix  Chart  V,  the  next  lower  curve  gives  40,  but  30  is  nearly 
large  enough.  Interpolating,  we  take  a  sample  size  of  32  for  an  equal-tails 
test. 


2.1.3.  Confidence  Intervals 

Sets  of  100(1  -  a)%  confidence  limits  for  the  population  mean  fx  are 


*  ±  Za/  2  — — 


V  » 


for  an  infinite  population 


x  ±  z, 


a/2 


g  In  -n 
V»  \  N  -  1 


for  a  finite  population  of  size  N 


where  a  is  the  known  population  standard  deviation  and  za/2  is  the  value 
in  the  standard  normal  distribution  such  that  the  probability  of  a  random 
deviation  numerically  greater  than  za/2  is  a.  (See  Sec.  1.1.10  for  a  general 
discussion  of  confidence  intervals.) 


Example.  In  an  assessment  process  several  measurements  are  made 
on  each  photographic  plate  to  determine  the  distance  between  two  points. 
For  one  plate  the  average  of  four  measurements  is  1.215  inches.  Assume 
that  the  population  of  measurements  has  a  normal  distribution  with 
standard  deviation  0.01  inch.  What  is  a  95%  confidence  interval  for 
the  distance  between  the  points  ? 

From  Appendix  Table  2,  we  find  z0 25  =  1.960.  Using  the  first 
formula  of  this  section,  we  obtain 


,  215  ± 


This  gives  the  95%  confidence  interval  of  1.205  to  1.225  inches. 


a.  Determination  of  Sample  Size.  We  can  determine  the  necessary 
sample  size  so  that  the  confidence  interval  will  be  of  length  2 h  (giving 
confidence  limits  x  ±  h).  From 


2z 


a/2 


=  2  h 


we  have 


• = {¥f 
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Example.  If  in  the  example  above  we  specify  the  length  of  a  95% 
confidence  interval  to  be  0.015  inch,  how  many  measurements  should 
be  made  on  each  plate? 

We  compute 

_  r 

L  0.0075  J 

A  sample  size  of  seven  is  required. 

2.2.  Test  for  /x  When  a  Is  Unknown 

Two  useful  tests  are  given  below  for  this  case — the  t1  test  and  the  /  test. 
The  first  is  quickly  and  easily  applied,  but  is  restricted  to  small  samples 
(2<«<10)  and  should  not  be  used  if  the  population  is  known,  or 
suspected,  to  be  non-normal.  It  is  often  used  for  making  a  preliminary 
check  on  a  portion  of  a  larger  sample.  The  t  test  involves  considerably 
more  computation,  but  it  is  the  most  powerful  test  available  for  this  case. 
It  is  valid  for  any  sample  size,  and  it  is  a  good  approximation,  especially 
for  large  samples,  even  though  the  population  is  non-normal  to  a  noticeable 
degree. 

2.2.1.  The  r i  Test 

To  apply  a  one-sided  rx  test  of  the  null  hypothesis  /x  =  a,  against  the 
alternative  /i  >  a,  on  the  basis  of  a  random  sample  from  a  normal  popula¬ 
tion,  compute 

x  —  a 


where  w  is  the  range  of  the  sample.  Compare  rx  with  the  value  in  Appendix 
Table  13(d)  for  the  given  sample  size  and  the  desired  level  of  significance. 
Reject  the  null  hypothesis  that  fx  —  a  if  rx  exceeds  the  tabled  value  (Ref. 
2.2).  The  equal-tails  test  is  identical  except  that  the  sign  of  tx  is  ignored 
and  the  critical  value  used  is  that  for  a/2. 
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2.2.2.  The  t  Test 

For  a  t  test  of  the  null  hypothesis  /x  =  a  on  the  basis  of  a  random 
sample,  compute 


x  —a 

vV» 

where 


j2  “  n  -  1  2  n  {n  -  1  j  ["  2  ^ 

as  in  the  definition  of  Sec.  1.1.4a.  For  an  equal-tails  test,  compare  the 
calculated  value  with  the  value  ta/ 2>n-i  under  P{t)  =  a/2  in  Appendix 
Table  3.  The  subscript  n  —  1  represents  the  number  of  degrees  of  free¬ 
dom  /.  If  |/|  >  ta/2,n-i,  reject  the  null  hypothesis  that  fi  =  a. 

For  a  one-sided  test  of  the  null  hypothesis  /x  —  a,  against  the  alternative 
fx  >  a,  use  the  rejection  region  t  > 

Example.  The  following  readings  represent  the  measurements  of 
outside  diameter  (in  inches)  for  nine  grains  of  the  same  type. 


2.021 

2.002 

2.001 

2.005 

1.990 

1.990 

2.009 

1.983 

1.987 

Assuming  a  normal  distribution,  we  wish  to  test  at  the  5%  significance 
level  whether  the  average  diameter  of  this  type  of  grain  is  2.000  inches. 

To  simplify  computation  we  code  the  data,  replacing  each  Xj  by 
j/i  =  1,000  (xj  —  2.000).  Then  the  computations  are  performed  on  the 
y/s  and 

_  y  +  2,000  Sy 

X  =  ”  1,000  4  =  1,000 

We  now  have  the  following  y{ s: 


21 

5 

9 


2 

-10 

-17 


1 

-10 

-13 
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We  compute 

=  ~12 
y=  -1.333 
x  =  2  +  (-0.001333) 

1.210 

.S  = -^  1(9)  (1.210)  -  (-  12).]  =  149.25 

=  12.22 
sx  =  0.01222 


2  +  (-0.001333)  ~  2 
001222/3 


-0.327 


From  Appendix  Table  3  we  find  /.02.-)>8  =  2.306.  Since  this  is  greater 
than  the  absolute  value  of  our  computed  /,  we  cannot  reject  the  null 
hypothesis  that  the  average  diameter  of  this  type  of  grain  is  2.000  inches. 


The  statistic  t  can  also  be  computed  directly  from  the  coded  values: 


7-0  =  -1.333 
12'22/3 


-0.327 


In  this  example,  since  2  <  n  <  10,  the  less  powerful  tx  test  could 
have  been  used  as  an  alternative  procedure. 


a.  Determination  of  Sample  Size.  An  experiment  is  to  be  performed 
and  the  results  are  to  be  analyzed  by  the  equal-tails  /  test  described  above. 
As  in  Sec.  2.1.2a,  we  can  determine  the  necessary  sample  size  for  a  t  test 
to  be  conducted  at  the  significance  level  a  if  there  is  specified  a  risk  (3  of 
accepting  the  null  hypothesis  n  =  a  when  actually  fx  has  the  alternative 
value  b.  It  is  necessary  to  have  an  estimate  of  the  standard  deviation 
(obtained  from  past  experience  with  similar  data). 

For  a  significance  level  a  =  0.05,  Appendix  Chart  VI  can  be  used  to 
obtain  the  sample  size.  Compute 

(7 
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using  for  <r  an  estimate  of  the  standard  deviation.  Opposite  A  and  /3  in 
Appendix  Chart  VI,  select  the  sample  size  n  indicated  on  the  next  lower 
curve,  or  interpolate  between  curves. 

The  absolute  (numerical)  value  of  |  b  -  a  |  is  used  since  Appendix 
Chart  VI  is  for  an  equal-tails  test. 

Example.  In  the  case  cited  above,  find  the  sample  size  necessary  to 
test  the  hypothesis  /*  =  2.000  inches  at  the  5%  level  of  significance. 
Assume  that  a  is  thought  from  past  experience  to  have  a  value  between 
0.005  and  0.010  inch,  and  that  a  (3  risk  of  0.2  is  specified  for  the  error 
of  accepting  the  null  hypothesis  /a  =  2.000  when  actually  fi  has  an 
alternative  value  of  2.000  ±  0.010. 


Using  the  two  extreme  values  for  a,  we  obtain 


A, 

A2 


1 2.000  -  2.010 j 

0.005 

1 2.000  -  2.010| 
0.010 


=  2 


=  1 


For  A  =  2  and  /?  =  0.2,  Appendix  Chart  VI  gives  n  =  5.  For  A  =  1 
and  /?  =  0.2,  the  chart  gives  n  -■  10.  Thus,  to  be  sure  of  keeping  the 
probability  of  a  Type  II  error  as  small  as  specified,  a  sample  size  of  10 
should  be  used. 


Notice  t.  .t  no  matter  how  small  a  sample  we  have  (provided  that  n  >  1, 
so  that  r  can  be  calculated),  we  can  test  the  hypothesis  n  =  a  at  any  sig¬ 
nificance  level,  say  a  =  0.001  or  even  less.  Thus  we  make  sure  of  avoiding 
the  error  of  rejecting  the  hypothesis  /x  =  a  when  it  is  true;  but  we  are  likely 
to  miss  recognizing  an  alternative  situation  n  =  b  when  it  is  true.  The  only 
way  to  make  the  risks  of  both  errors  sufficiently  small  is  to  choose  a  sample 
size  that  is  sufficiently  large. 


2.2.3.  Confidence  Intervals 

Confidence  limits  for  the  mean  ju.  with  coefficient  100(1  -  a)%  are 
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where  ta/ 2,n-i  ^  again  the  /  deviate  for  /  =  n  —  1  degrees  of  freedom,  the 
probability  of  exceeding  which  is  P(t)  =  a/2. 

Example.  A  recently  developed  fuze  is  tested  for  the  first  time  at 
high  temperatures.  The  coded  mean  of  arming  distances  for  10  fuzes 
is  42  units  of  distance.  The  value  of  s  is  2.1.  Under  the  supposition  that 
the  arming  distances  have  a  normal  distribution,  find  a  95%  confidence 
interval  for  the  mean  arming  distance  of  the  new  fuze. 

From  Appendix  Table  3  we  find  tM 6t9  =  2.262.  From  x  ±  ta/2<n. iVV  n 
we  have  42  ±  2.262  (2. 1)/Y  10  .  The  desired  confidence  interval  is 
40.5  to  43. 5  units  of  distance. 

a.  Determination  of  Sample  Size.  If  an  estimate  of  a  is  available, 
the  equation 


can  be  used  to  give  the  sample  size  necessary  to  obtain  a  100(1  ~  a)  % 
confidence  interval  with  expected  length  2 h.  The  actual  length  obtained 
from  the  data  will  be  calculated  from  the  first  equation  in  this  section; 
therefore  it  will  depend  on  the  s  calculated  from  the  sample,  varying 
randomly  above  or  below  2 h  to  an  extent  depending  on  the  discrepancy 
between  cr  and  s. 

Example.  How  many  readings  of  temperature  should  be  taken  if  it 
is  hoped  to  specify  the  temperature  within  ±0.5°  with  99%  confidence? 
From  previous  trials  made  with  the  same  thermometer  and  the  same 
observer,  the  standard  deviation  of  the  readings  is  expected  to  be  in  the 
neighborhood  of  0.6°. 

In  this  case  a  =  0.6°  and  h  =  0.5°.  Using  Appendix  Table  3,  we 
find  by  trial  and  error  that  for  n  =  13,  (<rt.005tn-i/h)2  =  13.44;  and  for 
n  —  14,  (ff/.oo5,n-i/^)2  =  13.06.  This  shows  that  for  n  —  13  the 
expected  length  of  the  confidence  interval  will  be  slightly  more  than 
the  prescribed  value  0.5°,  and  for  n  =  14  it  will  be  slightly  less.  In  such 
a  situation,  the  usual  practice  is  to  choose  the  larger  sample  size,  n  =  14. 

2.3.  Test  for  Proportion  p 

Often  the  sample  items  are  merely  classified  as  having  or  not  having 
a  certain  characteristic;  e.g.,  they  function  or  fail  to  function,  they  succeed 
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or  fail,  they  are  perfect  or  contain  defects.  Here  the  basic  distribution  is 
the  binomial  (Sec.  1.2.2).  We  are  interested  in  testing  a  hypothetical  value 
p o  for  the  population  proportion  having  the  characteristic.  The  unknown 
parameter  p  is  estimated  from  a  sample  as  the  ratio  r/n  of  the  number  r  of 
items  having  the  characteristic  to  the  sample  size 

2.3.1.  Assumptions 

(1)  A  proportion  p  <  0.5  of  the  items  of  a  population  have  a  particular 
characteristic.  The  artificial  restriction  p  <  0.5,  which  is  a  convenience  in 
constructing  tables,  is  satisfied  by  considering  p  as  the  proportion  not 
having  the  characteristic,  if  the  proportion  having  the  characteristic  is 
greater  than  0.5. 

(2)  The  sample  is  a  random  sample,  each  item  of  which  is  tested  for 
the  characteristic. 

(3)  Ideally,  the  population  is  infinite.  If  the  population  is  finite  (as  in 
a  pilot  lot),  the  methods  described  in  Sec.  2.3.2  give  good  approximations 
for  lot  sizes  down  to  10  times  the  sample  size.  (For  finite  populations,  see 
also  the  end  of  Sec.  2.3.3b.) 

2.3.2.  Test 

a.  Test  for  a  Sample  of  Size  n  ^  150.  To  test  at  the  5%  level  the 

hypothesis  that  p  has  the  value  p0,  against  alternative  values  greater  than 
/?0,  find  the  smallest  whole  number  r  for  which 

2  c;  k( i  -  ?„)"■'  <  o.o5 

8  =  1 

using  one  of  the  tables  described  in  the  next  paragraph.  Reject  the  null 
hypothesis  if  r  or  more  defectives  are  observed  in  the  sample.  Note  that 
this  is  a  one-sided  test. 

The  technique  shown  here  is  correct  for  any  sample  size,  though  its 
practical  use  depends  on  the  availability  of  tables.  Table  2  of  Ref.  2.5  lists 
the  values  of  the  entire  summation  shown  above  for  n  <  50,  for  all  values 
of  r,  and  for  p0  from  0.01  to  0.50  in  steps  of  0.01.  Ref.  2.7  extends  the 
table  of  Ref.  2.5  from  n  —  50  to  n  =  100  in  steps  of  5,  and  Ref.  2.1 
expands  the  table  to  include  all  n  <  150. 
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Example.  A  sample  of  40  rockets  is  chosen  at  random  from  a  large 
lot.  Design  a  test  for  determining  at  the  5%  level  of  significance 
whether  the  proportion  of  rockets  defective  (in  that  they  fail  to  arm)  is 
0.01,  assuming  that  we  are  interested  in  detecting  an  alternative  propor¬ 
tion  greater  than  0.01. 

Using  Table  2  of  Ref.  2.5,  we  find 

40 

2  C*°  (0.0 l)s  (0.99)40-*  =  0.007 

40 

2  c\°  (o.oi  y  (0.99)40-®  =  o.o6i 

*=2 

Thus,  if  the  sample  contains  three  or  more  defectives,  the  lot  should  be 
rejected  as  having  a  proportion  defective  greater  than  0.01. 


b.  Test  for  a  Sample  of  Size  n  >  5/p0.  To  test  at  the  significance 
level  a  the  hypothesis  that  p  =  p0  against  the  alternative  that  p  >  p0, 
compute 

r  -  np0-  Vz 
V  "M1  ~  Po) 

where  r  is  the  number  of  defectives  observed  in  a  sample  of  n.  Compare  z 
with  za  (from  Appendix  Table  2  with  F(z)  =  1  —  a)  and  reject  the 
hypothesis  if  z  >  za. 


To  test  the  null  hypothesis  p  =  p0  against  the  alternative  that  p  <  p0, 
compute 


r  —  np0  -f  Vz 

V  ”Po  (1  -  po) 

and  reject  the  null  hypothesis  if  z  <  —  za. 


For  an  equal-tails  test  of  the  null  hypothesis  against  alternatives  greater 
or  less  than  p0,  compute 

,  =  J LT”P»\-Vi 

V  »?o  (1  "  p,) 

and  reject  the  null  hypothesis  if  z  >  za/2. 
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These  tests  are  based  on  a  normal  approximation  that  is  sufficiently  good 
for  n  >  5/po-  F°r  a  sequential  test  for  a  proportion  p,  see  Sec.  8.4.1. 

c.  Determination  of  Semple  Size.  The  method  for  determining  the 
required  sample  size  will  not  be  taken  up  explicitly  for  the  testing  of 
hypotheses.  It  is  convenient  to  use  the  sample  size  n  necessary  to  give  a 
confidence  interval  of  given  length  (Sec.  2.3.3a)  for  the  significance  test 
as  well. 

2.3.3.  Confidence  intervals 

e.  Graphical  and  Tabular  Solutions.  Appendix  Charts  II,  III,  and 

IV  give  two-sided  90,  95,  and  99%  confidence  limits,  respectively,  for  p  as 
a  function  of  the  proportion  defective  in  the  sample  for  a  number  of  sample 
sizes.  For  n  <!  30,  Appendix  Table  21  (a)  should  be  used.  The  confidence 
coefficient  is  in  each  case  at  least  as  high  as  indicated;  it  cannot  be  made 
exact  because  of  the  discrete  nature  of  the  distribution.  The  charts  can  also 
be  used  to  determine  roughly  the  necessary  sample  size  for  a  confidence 
interval  of  specified  length.  (See  the  second  example  below.) 

For  one-sided  confidence  intervals,  use  Appendix  Table  21  (b).  Appendix 
Charts  II,  III,  and  IV  can  be  used  for  one-sided  intervals  with  confidence 
coefficients  of  95,  97.5,  and  99.5%,  respectively,  by  referring  to  only  one  of 
a  symmetrical  pair  of  curves. 

Example.  The  proportion  defective  in  a  random  sample  of  250 
items  checked  on  a  go-no-go  gage  is  12%.  What  are  95%  confidence 
limits  for  the  proportion  defective  for  total  production? 

The  desired  confidence  limits  are  marked  off  by  the  two  curves  for 
n  —  250  in  Appendix  Chait  III  at  r/n  —  0.12.  We  obtain  a  confidence 
interval  of  8  to  17%. 

Example.  In  estimating  the  proportion  defective,  the  maximum 
allowable  length  for  a  95%  confidence  interval  is  0.10.  What  sample 
size  should  be  used  ? 

From  Appendix  Chart  III,  we  find  that  the  95%  confidence  interval 
has  maximum  length  0.14  for  n  =  250,  and  0.0 6  for  n  =  1,000.  Linear 
interpolation  with  respect  to  1  /n  should  be  used. 
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n 

l,000/« 

250 

4  ) 

400 

2.5  \  3 

1,000 

i  ( 

„  0.04 

4 - 

0.08 

Maximum  length 


(5)  =  2.5 


We  find,  then,  n  —  1,000/2.5  =  400. 

It  is  perhaps  surprising  to  find  that  such  a  large  sample  size  is  required 
to  locate  a  proportion  with  an  interval  of  length  not  more  than  0.10 
with  95%  confidence;  however,  the  significance  of  a  proportion  from  a 
small  sample  is  often  overrated.  For  example,  if  we  test  a  sample  of  10 
new  missiles  and  all  10  fly  properly,  we  may  feel  confident  of  the  mis¬ 
sile,  but  the  95%  confidence  limits  for  the  proportion  defective  are  0 
and  0.267. 


b.  Numerical  Solution.  For  a  large  sample,  100(1  -  a)%  confidence 
limits  for  p  are  given  by 

r  +  z\/2/ 2  ±  V  (r  +  Zg/2/2)2  -  (n  +  zl/2}r*/h 
n  +  22/2 

where  r  is  the  number  of  defectives  observed  and  za/->  is  the  normal  deviate 
exceeded  with  probability  a/2.  In  case  the  population  is  finite  of  size  N, 
replace  z\n  by  z-a/2(N  -  n)/(N  -  1). 


2.4.  Test  for  ^  When  o-,  ard  Are  Known 

2.4.1.  Test  Conditions  and  Assumptions 

A  random  sample  is  drawn  from  each  of  two  populations  to  determine 
whether  the  difference  between  the  two  population  means  /xx  and  /*»  is 
equal  to  d.  The  two  samples  may  be  independent,  or  the  observations  may 
be  made  in  pairs,  one  from  each  population,  extraneous  variables  being 
held  constant  for  each  pair.  Let  y  denote  the  difference  xx  -  x2  of  the 
paired  observations.  If  the  standard  deviation  ay  resulting  from  such  pair¬ 
ing  is  known  and  is  less  than  ^  o-'^  +  a|i(,  then  it  is  desirable  to  pair  during 
sampling.  This  situation  will  arise  in  general  if  the  paired  observations 
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are  positively  correlated.  (See  Sec.  2.5  for  discussion  of  choice  of  method  if 
population  standard  deviations  are  unknown.) 

Each  population  is  assumed  to  be  normally  distributed  with  known  stand¬ 
ard  deviation  <ri — or  else  the  sample  drawn  from  it  is  large  (>  30),  in 
which  case  may  be  substituted  for  (see  Sec.  3.1). 

2.4.2.  Normal  Test 

a.  Test  for  Paired  Observations.  Subtract  the  second  reading  of 
each  pair  from  the  first  and  consider  the  resulting  differences  as  sample 
values  of  a  new  variable  y.  Test  the  null  hypothesis  that  [xy  =  d  by  the 
method  of  Sec.  2.1  if  ay  is  known,  or  by  the  method  of  Sec.  2.2  if  av  is 
unknown.  (It  is  known  from  theory  that  fiv  -  jxx  -  /*2.)  Confidence  inter¬ 
vals  for  ixv  and  determinations  of  sample  size  can  also  be  obtained  by  the 
methods  of  Sec.  2.1  and  2.2. 

b.  Test  for  Two  Independent  Semples.  To  test  at  the  significance 
level  a  the  hypothesis  that  /x1  -  fx2  =  d ,  compute 


As  in  Sec.  2.1.2,  reject  the  null  hypothesis  that  ixx  —  ix2  —  d  \i  z  falls  in 
the  rejection  region  \z\>  zari  for  an  equal-tails  test  or  z  >  z„  for  a 
one-sided  test  against  alternatives  greater  than  d. 

If  the  two  populations  are  finite  of  sizes  Nx  and  N2,  replace  of  by 
of  (N x  -  »,)/(Nj  -  1)  and  of  by  of  (N2  —  n2)/(N2  —  1)  in  the  formula 
for  z. 

Example.  A  standard  method  of  chemical  analysis  for  determining 
nitrocellulose  in  propellants  is  known,  from  the  large  number  of  previous 
analyses,  to  have  a  standard  devirtion  o  =  0.6.  Each  of  two  new  pro¬ 
pellants,  assumed  homogeneous,  was  subjected  to  five  analyses.  One 
result  was  lost  because  of  an  accident;  the  other  nine  results  were: 

xlf-  63.12,  63.57,  62.81,  64.32,  63-76% 
x2j:  62.54,  63.21,  62.38,  62.06% 

We  wish  to  test  at  the  5%  significance  level  whether  there  is  any  real 
difference  in  nitrocellulose  content. 
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The  hypothesis  to  be  tested  is  that  ^  —  fi2,  or  ^  -  /.t2  —  0.  We 
compute 

x1  —  62  +  —  ~~  =  63.5 16 
2.19 

x2  =  62  +  — =  62.548 


Then 


z  = 


0.968 


=  2.405 


From  Appendix  Table  2,  for  F(z)  =  1  —  a/2  we  find  z  025  =  1.960. 
Since  2.405  is  greater  than  1.960,  we  reject  the  null  hypothesis  that 
=  fi2  at  the  5%  level  of  significance. 

c.  Determination  of  Size  of  Independent  Samples.  To  make  most 
effective  use  of  a  fixed  number  N  of  items  allocated  for  sampling,  choose 


nx  = — — — N  and  n2  =  N  -  nx 

(7|  (To 

If  ffi  =  a2  =  <r,  choose  =  w2  =  To  determine  »  for  an  equal-tails  test 
at  the  5%  level  of  significance  with  an  assigned  risk  /J  of  accepting  the 
hypothesis  that  /a,  -  ju,2  =  d  when  it  really  has  an  alternative  value  b, 
compute 

<7^  2 

and  refer  to  Appendix  Chart  V. 

Example.  A  delayed-action  fuze  in  present  use  has  a  standard  devi¬ 
ation  in  delay  time  of  0.004  seconds.  An  experiment  is  to  be  performed 
to  determine  whether  a  new  experimental  fuze  has  the  same  mean  delay 
time.  The  test  is  designed  to  detect  at  the  5%  significance  level  either 
an  increase  or  a  decrease.  The  standard  deviation  for  the  two  fuzes  is 
thought  to  be  the  same.  How  large  a  sample  of  each  should  be  used,  if 
there  is  an  assigned  risk  of  0.05  of  deciding  that  fix  —  fi2  =  0  when 
j  fx !  -  fi2  j  has  the  alternative  value  of  0.005? 
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Here  d  =  0,  b  =  0.005,  and  (3  =  0.05.  To  detect  a  change  in  delay  time 
in  either  direction,  an  equal-tails  test  will  be  used.  Since  =  <r2  =  0.004, 
we  choose  nx  =  n2  =  n.  We  have 


_  j  0.005  -  0| 
(0.004)  VI 


0.88 


From  Appendix  Chart  V  for  A.  =  0.88,  a  =  5%,  and  (3  =  5%,  by  interpo¬ 
lating  between  curves  we  obtain  the  sample  size  n  =  18.  Thus,  18  fuzes 
of  each  type  should  be  tested. 


2.4.3.  Confidence  Intervals 

A  set  of  100(1  —  a)%  confidence  limits  for  ^  —  fi2  for  independent 
samples  is 

/  °i  a2 

(Xl-x2)  ±za/2  {--+ 

a.  Determination  of  Size  of  independent  Samples.  If  the  length 

2za/2  [(o-f/Wj)  +  (<ri/tt2)]H  of  the  100(1  -  a)%  confidence  interval  for 
fj.x  -  n2  is  specified  as  2 h,  the  total  sample  size  should  be 

N  =  »,  +  ,,,  = 

giving 

>h  =  (^j^j  ai  \ai  +  *2)  and  n2  =  a2  (°i  +  ^2) 


2.5.  Test  for  gj  —  g 2  When  <ri  and  a2  Are  Unknown 

2.5.1.  Assumptions  and  Design  of  Experiment 

The  assumptions  vary  with  the  different  situations  and  tests  given  below. 
Random  samples  are  necessary,  as  usual.  Normal  distributions  are  assumed 
except  in  the  sign  test.  Using  the  common  normality  assumption,  we  must 
decide  whether  to  sample  the  two  populations  independently,  or  to  pair 
observations  so  that  conditions  which  usually  fluctuate  are  the  same  for  each 
pair,  though  changing  from  pair  to  pair.  Thus,  in  a  durability  test  of  paints, 
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panels  of  two  or  more  paints  would  be  exposed  side  by  side  and  would  thus 
give  paired  rather  than  independent  observations.  If  the  conditions  that  are 
rendered  the  same  by  pairing  are  known  to  influence  substantially  the 
characteristic  x  under  observation,  then  it  is  probably  desirable  to  pair  obser¬ 
vations.  The  advantage  of  independent  samples  lies  in  the  doubling  of  the 
number  of  degrees  of  freedom  with  no  increase  in  the  number  of  obser¬ 
vations,  but  this  advantage  may  or  may  not  be  overbalanced  by  the  reduction 
in  variance  afforded  by  pairing;  each  problem  must  be  examined  carefully. 
Pairing  has  the  additional  advantage  of  eliminating  any  assumption  about 
the  equality  of  oy  and  ct2- 

2.5.2.  Test  for  Paired  Observations 

a.  Sign  Test.  (See  Ref.  2.2.)  The  sign  test  provides  a  quick  and  simple 
way  of  determining  whether  a  set  of  paired  readings  shows  a  significant 
mean  difference  between  pair  members.  Unlike  the  t  test  described  below, 
this  test  does  not  depend  on  an  assumption  of  normality;  however,  when 
normality  assumptions  are  valid,  it  is  not  as  efficient  as  the  t  test.  For  the 
sign  test,  it  is  unnecessary  that  the  different  pairs  be  observed  under  the  same 
conditions,  so  long  as  the  two  members  of  each  pair  are  alike.  As  used  here, 
it  is  an  equal-tails  test,  but  it  can  be  adapted  for  use  as  a  one-sided  test. 

Example.  The  first  two  lines  of  the  tabulation  below  represent 
the  deflection  in  mils  of  rockets  fired  in  pairs  simultaneously  from  two 
launchers,  A  and  B.  Can  we  say  that  there  is  a  difference  between  the 
performances  of  the  two  launchers  at  the  5%  significance  level? 

To  obtain  the  signs  shown,  we  record  a  'plus”  whenever  the  A  row 
shows  a  higher  value  than  the  B  row,  a  "minus”  when  it  is  lower,  and 
a  zero  when  it  is  the  same. 


A  .  4.4  -1.4  3.2  0.2  -5.0  0.3  1.2  2.2  1.3  -0.7 

B  .  3.2  7.7  6.4  2.7  3.1  0.6  2.6  2.2  2.2  0.9 

Sign .  +  —  —  —  —  —  —  0  —  — 


We  note  that  there  are  fewer  +’s  than  —  ’s,  that  the  total  number  N  of 
+  ’s  and  — ’s  is  9,  and  that  the  number  of  +’s  is  1.  The  zero  is  omitted 
from  all  counts.  In  Appendix  Table  9  opposite  9  under  a  =  .05,  we 
find  the  number  1.  Since  the  number  of  4-  signs  is  less  than  or  equal  to 
the  tabled  value  l ,  we  decide  that  there  is  a  significant  difference  between 
the  performances  of  the  two  launchers  at  the  5%  level. 
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b.  The  t  Test  on  Differences.  The  t  test,  which  depends  on  an  assump¬ 
tion  of  normality  and  requires  that  all  pairs  be  observed  under  the  same 
conditions,  gives  more  information  than  the  sign  test.  Subtract  the  pair 
values  and  consider  the  resulting  differences  as  the  data  to  be  analyzed  by 
the  method  of  Sec.  2.2.2. 

2.5.3.  Test  for  Two  Independent  Samples  When  ai  =  o2 

a.  The  rd  Test.  For  small  samples  («i  =  n2  —  n  <  10)  or  for  rapid  pilot 
computations  using  part  of  a  large  sample,  compute 

=  *1  ~  *2 
Ti  wx  +  w2 

where  the  tv's  are  the  respective  ranges.  Compare  rd  with  Appendix 
Table  13 (b)  and  reject  at  the  significance  level  a  the  hypothesis  that  the 
means  are  equal  if  rd  exceeds  the  value  tabled  under  a.  (See  Ref.  2.2.)  This  is 
a  one-sided  test,  but  the  equal-tails  test  is  identical  except  that  the  sign  of  rd 
is  ignored  and  the  critical  value  used  is  that  for  a/2. 

b.  The  t  Test.  In  general,  to  test  the  hypothesis  that  fix  —  ix2  =  d , 
compute 


where 


»i  nz 

2  (*i> "  *i)2  +  2  ( x2j  ~  x2)2 
2  __  r=i _ hi _ 

n1  +  n2  —  2 

is  a  "pooled"  estimate  (see  Sec.  3.1).  Compare  /  with  the  value  ta/ 2,n1+n2-2 
in  Appendix  Table  3,  and  reject  the  hypothesis  if  t  exceeds  the  tabled  value. 
As  in  Sec.  2.2.2  for  an  equal-tails  test,  the  value  of  ta/ 2>n1+»,-2  is  the  t  value 
for  /  =  nx  +  n2  -  2  degrees  of  freedom  such  that  the  probability  is  a/2 
of  a  random  reading  greater  than  /a/2,n1+n,-2  an^  a/2  of  a  reading  less  than 
-/<V2,n1  +  *2-2-  For  a  one-sided  test,  compare  /  with  /a>Wl+„2-2- 

To  determine  sample  size  for  an  equal-tails  test  at  the  5%  significance 
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level  with  an  assigned  risk  ft  of  accepting  the  hypothesis  /tx  -  fx2  =  d  when 
f. —  fi2  actually  has  the  alternative  value  b,  compute 

<rY 2 

For  a,  use  an  estimate  of  the  common  standard  deviation  of  the  two 
populations.  Refer  to  Appendix  Chart  VI  for  the  common  sample  size 
n1  ~  n2  —  N/2. 

Confidence  limits  for  fix  -  fi2  with  confidence  coefficient  1  —  a  are 

/  i  i  \  x 

Example.  The  variability  of  two  cutting  machines  is  assumed  to  be 
the  same.  One  is  set  to  cut  pieces  3.00  mm  shorter  than  the  other  to  allow 
for  insertion  of  a  compressible  spacer.  There  is  assigned  a  risk  of  0.05 
of  accepting  the  hypothesis  that  the  mean  difference  is  3.00  mm  when  it 
has  an  alternative  value  3.00  ±  1.00  mm.  If  the  standard  deviation  in 
both  cases  is  of  the  order  of  magnitude  of  0.5  mm,  what  sample  size 
should  be  used  for  a  test  at  the  5%  level  of  significance? 

Taking  «x  =  n2-  n,  we  have 

\d-b\  11.001 

A  =  - =4  =  — - J=-  =  1.41 

<rV2  0.5  V2 


TABLE  2.1.  Length  in  mm  of  Pieces  Produced 


by  Two  Cutting  Machines 


Machine  1,  xu 

Machine  2,  xu 

98.06 

94.73 

98.07 

95.02 

98.93 

94.36 

98.03 

96.16 

98.50 

94.50 

98.60 

94.82 

99.67 

93.75 

98.51 

93.90 

98.06 

95.31 

99.01 

95.21 
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From  Appendix  Chart  VI  for  an  equal-tails  test  with  A  =  1.41  and 
f 3  =  0.05,  we  read  n  =  10  for  each  machine. 

Suppose  that  when  the  experiment  is  run,  the  readings  in  mm  are 
as  shown  in  Table  2.1.  After  coding  the  readings  by  replacing  xxj  by 
yXj  —  100  (x1;-  -  98)  and  x2j  by  y2j  =  100  (x2j  -  93),  the  data  will 
appear  as  shown  in  Table  2.2. 


TABLE  2.2.  Coded  Lengths  of  Pieces 
From  Two  Cutting  Machines 


yu 

yi> 

6 

173 

7 

202 

93 

136 

3 

316 

50 

150 

60 

182 

167 

75 

51 

90 

6 

231 

101 

221 

We  compute 

2  yn  =  544 

x,  =  98  +  0.544  =  98.544 
2ylj  =  55,570 

2  0 '»/  -  702  =  2  yh  ~  jr  (2  yuY  =  25,976.4 

2>2/=  1,776 

x2  =93  +  1.776  =  94.776 


2J§/  =  360,636 


2  (j2j  y^) 2  2  jsy 


r2  = 
J0V 


(2  72;)2  =  45,218.4 
25,976.4  +  45,218.4  _  3 
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%  =  62.891 
j0x  =  0.6289 

_  98.544  -  94.776  -  3 
0.6289^-1  +  1  Y 

\10  1 J 

^025,18  =  2.101 


2.730 


We  reject  the  null  hypothesis  that  the  mean  difference  between  the 
lengths  cut  by  Machine  1  and  those  cut  by  Machine  2  is  3  m.n. 


2.5.4.  Test  of  ^  -  fi2  =  0  for  Two  Independent 
Samples  When  <r1^=a2 


For  a  general,  treatment  of  this  subject,  see  Ref.  2.6,  p.  91. 

Label  the  samples  so  that  nx  n2.  Use  the  readings  in  the  order  of 
observation  or  else  randomize  them  by  lottery  choice.  It  is  best  in  designing 
such  an  experiment  to  make  the  two  sample  sizes  equal,  since  the  analysis 
makes  little  use  of  extra  sample  items  from  either  population. 

Define 


Q  =  »i  2  (*i  “  UY  =  #i  2  “  (2  »i)2 

i=i  »=i  i=i 


Then  compute 


where 


/  - 


X\  X2 

Le  - 

"V  »?(»!-!) 


1  nl 

*1  =  —  2  *ii> 

"l  i-i 


J  »2 

—  2*2i 


2  i=i 


Reject  the  null  hypothesis  that  /+  =  fx2  at  the  significance  level  a  if  /  falls 
in  the  rejection  region  |/|  >  ta/ 2,^-1  for  an  equal-tails  test  or  the  region 
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/  >  ta,nri  for  a  one-sided  test.  Appendix  Table  3  gives  percentage  points 
for  the  t  distribution. 

Example.  Range  data  are  given  in  Table  2.3  for  two  types  of  rockets. 
The  true  (population)  range  dispersions  are  not  expected  to  be  equal. 


TABLE  2.3.  Range  Data  for  Two  Types  of  Rockets 


Type  II, 

Xu,  yd 

Type  I, 
xu,  yd 

Xu  V  Ki/fli 

Ui=Xu~XuV  »l/t)i 

1,472 

mm 

403.3 

1,315 

mom 

417.6 

1,984 

hi 

—290.1 

1,900 

mSBSm 

—597.5 

1,646 

1,374 

1,502.6 

—  128.6 

1,709 

1,325 

1,560.1 

-235.1 

1,780 

1,821 

1,624.9 

196.1 

1,571 

2,351 

1,434.1 

916.9 

1,681 

1,883 

1,534.5 

348.5 

1,453 

1,613 

1,326.4 

286.6 

1,799 

1,843 

1,642.3 

200.7 

1,372 

1,796 

1,252.5 

543.5 

1,420 

1,507 

1,296.3 

210.7 

1,612 

1,387 

1,471.5 

-84.5 

1,546 

725 

1,411.3 

—686.3 

1,255 

1,041 

1,145.7 

—  104.7 

1,818 

1,652 

1,659.6 

-7.6 

1,357 

1,595 

1,238.8 

356.2 

1,412 

1,679 

1,289.0 

390.0 

1,775 

1,557 

1,620.3 

-63.3 

1,850 

1,206 

1,688.8 

—482.8 

1,417 

192 

1,293.5 

—  1,101.5 

2,057 

1,025 

1,877.8 

—852.8 

1,468 

813 

1,340.1 

—527.1 

1,694 

971 

1,546.4 

—575.4 

1,776 

,  ,  ,  , 

1,456 

1,489 

1,618 

1,544 

.... 
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Test  the  mean  ranges  of  the  two  types  of  rockets  for  equality  at  the  5% 
significance  level. 

We  compute 

x2  =  1,608.2  x,  =  1,415.2 


=  30 


2>i  =  -1,467.2 


/>!  =  25 

=  5,867,573.36 


<2  =  25(5,867,573.36)  -  (-1.467.2)2  =  144,536,658.16 

^f~i)  - 9’635-78 


Q  1* 

_#?  (#i  - 1)_ 


=  98.16 


*x  - 


Q  1' 

.»?(» i  “  1). 


193.0 

98.16 


=  -1.966 


From  Appendix  Table  3,  2,nri  =  ^.025,24  =  2.064.  Since  1.966  does 
not  exceed  2.064,  the  hypothesis  that  the  two  means  are  equal  cannot  be 
rejected  at  the  5%  significance  level. 

2.6.  Test  for  a  Difference  Between  Proportions,  pi  —  p2 

To  determine  whether  two  samples  come  from  binomial  populations 
having  the  same  proportion  defective,  see  Sec.  4.7.1  on  contingency  tables. 

2.7.  Test  for  Differences  Among  Several  Means 

Use  the  method  of  "analysis  of  variance"  as  described  in  Chapter  5. 

2.8.  Detection  of  a  Trend  in  a  Set  of  Means,  Standard 
Deviations  Unknown  but  Assumed  Equal 

In  addition  to  the  sections  below,  see  Chapter  6. 


2.8.1.  Assumptions 

A  random  sample  of  size  n  is  drawn  from  each  of  k  normal  populations 
with  the  same  standard  deviation.  The  common  sample  size  n  may  be  1. 
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2.8.2.  Method 

A  trend  in  means  is  some  relationship  among  the  means  which  can  be 
represented  graphically  by  either  a  straight  line  or  a  curve.  The  ordinary 
formula  for  variance  among  means  in  a  k  sample  problem  is 

=  ttt2  (*f  -  *)* 

K  1  i=rl 

where  x  is  the  mean  of  the  k  sample  means  and  X;  is  the  mean  of  the  ;th 
sample.  Now  s 2  will  include  the  effect  of  a  trend  in  means,  if  a  trend  is 
present.  An  estimate  of  variance  which  reduces  the  trend  effect  is  1/28 2 
where 

82  =  k~-  \  2  (**+»  ~  X'Y 

k  1  i=i 

—  mean  square  successive  difference 

To  test  whether  a  trend  in  means  is  present,  we  compute  82/s2  and  compare 
the  result  with  the  appropriate  critical  value  of  S 2 /s'1  in  Appendix  Table  14. 


TABLE  2.4.  Glucose  Concentration  of  Five 
Mixtures  Tested  Over  a  35-Week  Period 


No.  of  weeks 

Mean  %  glucose,  Xi 

X.+j  —  Xi 

4 

43.14 

—0.98 

8 

42.16 

2.84 

12 

45.00 

—  1.00 

16 

44.00 

3.86 

20 

47.86 

0.29 

24 

48.15 

1.95 

27.5 

50.10 

—  1.05 

30 

49.05 

1.10 

35 

50.15 
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Three  situations  may  occur. 

1.  If  the  data  show  no  trend,  we  expect  a  82/s2  near  2. 

2.  If  the  data  follow  some  curve,  we  expect  a  82/s 2  less  than  2. 

3.  If  the  data  oscillate  rapidly,  we  expect  a  8-/s-  greater  than  2. 

Example.  Five  mixtures  containing  glucose  are  allowed  to  stand  for 
a  total  of  35  weeks.  At  approximately  4-week  intervals,  the  %  glucose 
concentration  is  determined.  The  means  (each  being  the  mean  of  five 
observations)  are  given  in  Table  2.4.  We  wish  to  test  at  the  5%  signifi¬ 
cance  level  whether  the  mean  %  glucose  concentration  increases  with 
time. 

We  compute 

-  2  (*i+i  ~  *%)2/(k  ~  I) 

-W-i) 

S  (xi+l  xi)~ 

2(*i  ~  *)8  ' 

_  __  2  (x'jfi  ~~  x,)~ 

2  x?  -  ('StxY/k' 

_  31.1247 
~  75.8454 

-  0.4104 

In  Appendix  Table  14  for  k  —  9,  the  95%  critical  value  for  82/sz  is 
1.0244.  Since  0.4104  <  1.0244,  S z/s2  is  significant  at  the  5%  level. 
Thus,  a  trend  in  mean  %  glucose  concentration  as  a  function  of  time 
does  exist. 
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Chapter  3 

TESTS  AND  CONFIDENCE  INTERVALS  FOR 
STANDARD  DEVIATIONS 


3.0.  Introduction 

In  this  chapter,  methods  are  given  for  (1)  testing,  by  examination  of  a 
random  sample,  the  hypothesis  that  the  standard  deviation  a  (or  the 
variance  <r2)  of  a  normal  population  has  a  certain  hypothetical  value,  speci¬ 
fied  before  the  experiment;  (2)  calculating  confidence  limits  for  the  estimate 
of  a;  and  (3)  comparing  two  or  more  standard  deviations. 

What  is  the  importance  of  the  population  standard  deviation?  It  is  a 
measure  of  dispersion;  in  other  words,  a  measure  of  lack  of  precision.  As 
such,  it  includes  both  lack  of  reproducibility  and  whatever  measurement 
error  is  present.  A  weapon  cannot  be  used  for  tasks  requiring  precision  if 
test-firing  shows  that  the  standard  deviation  in  distance  from  a  fixed  target 
is  large.  On  the  other  hand,  if  the  standard  deviation  is  small,  but  the 
average  distance  of  shots  from  the  target  line  is  large,  it  is  possible  that  a 
simple  aiming  correction  will  center  the  shots  on  the  target.  If  the  correction 
can  be  made  without  increasing  the  dispersion,  the  weapon  will  be  both 
accurate  and  precise. 

In  production,  a  sudden  increase  in  variability  may  indicate  the  appear¬ 
ance  of  a  production  fault,  such  as  the  maladjustment  of  a  machine  or  the 
dulling  of  a  cutter  blade. 

The  tests  presented  here  are  based  on  the  assumption  that  the  population 
is  normal.  This  restriction  can  often  be  relaxed,  especially  in  cases  involving 
large  samples.  All  samples  are  taken  to  be  random  samples.  (See  Chapter  4 
for  tests  of  normality  and  tests  of  randomness.) 

All  the  significance  tests  of  Chapter  3  involve  fixed  sample  sizes.  Hov/- 
ever,  if  it  is  practical  to  experiment  on  one  item  at  a  time  without  knowing 
the  total  number  that  will  be  required,  the  most  economical  method  is 
provided  by  "sequential  analysis.”  (See  Sec.  8.4  and  its  references.) 
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3.1.  Computation  of  s 

The  sample  variance,  and  best  estimate  of  the  population  variance  <r2,  is 


2  (*i  -  = 


1 

n(n  -  1) 


The  latter  form  is  preferred  for  computation.  The  sum  of  the  Xj’s  and  the 
sum  of  the  xfs  can  be  computed  simultaneously  on  most  desk  computers 
by  accumulated  multiplication. 

If  k  samples  from  populations  having  a  common  variance  are  available 
for  estimating  the  variance,  the  pooled  estimate  is 


(«!  -  l)jj  +  (n2  -  l)jg  +  •  •  •  -f  In*  —  1  )sl 
ti1  +  n2  4-  •  •  ■  4-  njc  —  k 

where  (nx  -  l)s\  is  the  sum  of  the  squared  deviations  of  observations  in 
the  first  sample  from  their  mean,  and  so  on.  If  the  k  samples  are  from 
populations  that  may  have  different  means,  s%  should  be  used  instead  of 
an  over-all  computed  s2,  as  the  latter  will  overestimate  the  variance  by 
including  the  differences  among  the  means.  The  pooled  estimate  r0  may  be 
used  to  make  significance  test:  and  to  find  confidence  intervals  for  the 
common  ,'tandard  deviation  of  the  k  populations  with  the  number  of 
degrees  of  freedom  n1  +  n2  +  •  •  •  4 ■  nk  —  k.  (An  example  of  the  use 
of  rg  occurs  in  Sec.  2.5.3b.) 

If  the  x  values  are  coded  by  letting  =  mx^  +  b ,  where  m  and  b  are 
constants, 

x  =  —  (y  -  b)  and  s2  - 

m  v  '  m2  y 

Examples  of  the  computations  appear  throughout  this  chapter;  see,  for 
instance,  Sec.  3.2.2. 

If  means  and  standard  deviations  must  be  obtained  from  data  that  have 
been  grouped  together  in  intervals  of  length  h,  so  that  all  observations  lying 
in  one  interval  are  considered  to  be  at  the  midpoint  of  the  interval,  we  may 
employ  "Sheppard’s  correction"  to  improve  the  approximation.  This  cor¬ 
rection  calls  for  subtraction  of  h2/ 12  from  the  j2  computed  using  midpoints. 
The  correction  is  appropriate  i:  the  sample  is  taken  from  a  distribution 
that  tapers  to  zero  at  the  ends.  (See  Ref.  4.4,  pp.  359-63.) 
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If  y  is  a  linear  function  of  several  independent  variables  x„  x2,  •  •  • ,  xfc, 

7  =  ^iXj  +  d2x2  +  •  •  •  +  <jfeXfc  +  b 

then  the  variance  of  y  is 

<*%  ~  +  a'i +  •  ■  •  +  dl&j.k 

The  standard  deviation  of  y  may  be  found  in  terms  of  the  standard  devia¬ 
tions  of  the  x’s  as 


cr„  -  (rf'ltr;,  +  al<r'i2  +  *  *  ’  + 

Example.  If  y  =  (xx  +  x2  +  •  •  •  +  xrt)/w,  where  the  Xi’s  form  a 
random  sample  from  a  population  with  standard  deviation  <rX!  then 
<r{5  =  «(l/«)2<rj  =  a\/n.  Hence  <rj  =  oT/S  w* 


C «u»  rt  l.-x 


j\2y  vv?HCi.c  ^ j  uuu  ajj  uiv,'  tile  means  ox  iiiuc* 


pendent  random  samples  of  size  nx  and  »2>  then 


al  =  *1,  + 


2i+°h 


If  y  =  F(xu  x2,  •  •  •  ,  x*),  and  the  changes  Axj  in  the  x’s  are  small,  the 
differential  dy  approximates  the  change  A y  in  y.  Thus 

dF  3  F  3  F 

A>-Jy  =  3^  +  aZ +  '  •  '  T  Ax* 

If  the  changes  Ax(  in  the  x’s  are  independent,  the  variance  in  Ay  may  be 
approximated  by 


/  3  F  V  (  W  Y 

^-{dxj  +  \3xJ 


c rl  + 


(dF  V 

V^j  * 


with  the  partial  derivatives  evaluated  at  nominal  values  of  Xj,x2,  •  •  •  ,  x*. 

Example.  The  vacuum  velocity  of  a  rocket  after  burning  can  be 
computed  as 

mb  +  mv 
v=vg\oge — — * 


The  gas  velocity  v?,  the  "burnt”  rocket  weight  mh,  and  the  propellant 
weight  mv  are  assumed  to  have  small  independent  variations.  Find  the 
variance  of  the  vacuum  velocity  v  in  terms  of  the  variances  of  v9,  mb, 
and  tnp. 
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Taking  the  total  differential,  we  have 


=  Ayff  +  - —  Aw6  +  —  Ara„ 
9  omj) 


-  mpVg 


+  1  mb(mb  +  mp)  J 


and  the  variance  of  v  is  approximately 


/  mb  +  mp\  f 


~mpv0 


mb(mb+mp)  J  mb 


(t'Z  4- 


(w5  +  /»„)  AWp 

(  »JL  Y  2 

\mb  +  mp)  Gmp 


If  a  new  type  of  rocket  is  to  have  =  8,000  feet  per  second,  mb  =  50 
pounds,  and  mp  =  50  pounds,  we  can  compute 

<7?.  c*  0  480(r?  +  6  400rrr...  4-  6.400(7?., 

*»  -  —  V  t 


3.2.  x2  Test  for  <j  in  a  Normal  Distribution 


3.2.1.  Assumptions 

(1)  The  population  has  a  normal  distribution. 

(2)  The  sample  is  a  random  sample. 

3.2.2.  Test 

To  rest  the  null  hypothesis  that  the  normal  population  has  standard 
deviation  a  —  a0  at  the  significance  level  a  on  the  basis  of  a  sample  of 
size  u,  we  compute  the  sum  of  the  squared  deviations  from  the  sample 
mean,  (?;  —  l)r2,  and  form  the  chi-square  statistic 


Appendix  Table  4  gives  values  of  the  x2  distribution  for  /  =  n  —  1  degrees 
of  freedom.  If  the  computed  x2  is  less  than  XI-a/2,n-i  for  P  =  1  -  a/2,  or 
greater  than  Xa/2,n-i  for  P  =  a/2,  we  reject  the  null  hypothesis  that  the 
standard  deviation  is  <70,  for  if  the  standard  deviation  actually  is  o-0,  such  a 
large  or  such  a  small  sample  variance  will  occur  with  a  probability  of 
only  a.  (See  Sec.  1.1.7  for  a  general  discussion  of  statistical  tests.)  The 
number  Xa/2,);-i  is  the  x2  value  exceeded  with  probability  a/2  for  n  —  1 
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degrees  of  freedom.  Similarly,  for  n  —  1  degrees  of  freedom,  x2-a/2,n-i  ‘s 
the  x2  value  exceeded  with  probability  1  —  a/2.  (See  Sec.  8.4.2  for  a 
sequential  test  for  a.) 

Example  (Equal-Tails  Test).  The  data  in  Table  3-1  represent  the 
burning  times  of  13  grains.  Assuming  that  the  distribution  of  burning 


TABLE  3.1.  Actual  and  Coded  Burning 
Times  of  13  Grains 


Burning  time,  sec 

(Xi) 

Coded  value 
( y .  =  1,000*,  —  500) 

0.51 6 

16 

0.508 

8 

0.517 

17 

0.52y 

29 

0.501 

1 

0.521 

21 

0.539 

39 

0.509 

9 

0.521 

21 

0.532 

32 

0.547 

47 

0.504 

4 

0.525 

25 

times  is  normal,  test  at  the  5 %  level  of  significance  whether  the  standard 
deviation  of  the  population  is  0.01  (i.e.,  <r  =  a0  =  0.01).  We  compute 

2*  =  269 


2/7  =  7,809 

(„  _  1W  = 


mi  ,  2,242.769 

13 


(n  -  l)r2  =  2,242.7 69  X  10-,: 

,  =  (n  -  1  )s'j  =  2,242.7 69  X  10~u 


10- 


=  22.43 
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From  Appendix  Table  4  for  12  degrees  of  freedom,  we  read 

=  4.404  and  x*02s,i2  =  23-34 

Since  22.43  lies  inside  this  interval,  we  cannot  reject  the  hypothesis  that 
a  —  O.Oi. 

Example  (One-Sided  Test).  If  in  the  case  of  burning  time  we  are 
interested  only  in  whether  a  exceeds  0.01,  and  are  quite  satisfied  with 
any  smaller  standard  deviation,  we  compare  —  22.43  with 

x2  =21  03 

A. 05. 12 

Since  22.43  exceeds  21.03,  we  reject  at  the  5%  level  of  significance  the 
hypothesis  that  a  <  0.01,  and  conclude  that  <r  exceeds  0.01. 

a.  Determination  of  Sample  Size.  We  can  determine  the  necessary 
sample  size  for  a  one-sided  test  of  the  null  hypothesis  that  the  standard 
deviation  a  equals  some  hypothetical  value  or0  rather  than  alternative  values 
greater  than  au  at  the  significance  level  a.  (As  defined  in  Sec.  1.1.8,  the 
significance  level  a  is  the  risk  of  rejecting  the  hypothesis  that  the  standard 
deviation  is  o r0  when  the  standard  deviation  actually  is  or0.)  We  want  the 
risk  to  be  not  more  than  j8  of  accepting  the  hypothesis  that  a  is  a0  when 
actually  cr  has  the  alternative  value  <rj  =  Aa,,.  Appendix  Chart  VII  gives 
OC  curves  for  several  sample  sizes  for  a  one-sided  test  (A  >  1)  conducted 
at  the  5%  level  of  significance.  Compute 

A  =  — 

<r„ 

and  enter  the  chart  with  A  and  /?.  Use  the  sample  size  indicated  on  the  next 
lower  curve,  or  interpolate  between  curves. 

Example.  We  wish  to  make  a  one-sided  test  at  the  5%  level  to 
determine  whether  the  standard  deviation  of  length  of  grains  made  by 
a  new  process  is  larger  than  the  standard  deviation  obtained  under  the 
former  process.  We  will  accept  a  risk  of  0.1  of  not  detecting  a  standard 
deviation  half  again  as  large  as  the  old  standard  deviation.  How  large 
a  sample  size  should  be  used  ? 

In  this  case  A  =  l.S70/<j0  =  1,5,  and  /?  =  0.1.  Appendix  Chart  VII 
shows  that  a  sample  size  of  28  is  necessary.  Using  this  sample  size 
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assures  us  that  we  can  accept  the  null  hypothesis  <r  —  <r„,  or  the  alternative- 
hypothesis  <r  =  a,  =  1 .5<t0,  with  satisfactorily  small  risks  of  being  wrong 
in  either  case. 

3.2.3.  Confidence  Intervals 

A  set  of  100(1  -  a)%  confidence  limits  for  the  standard  deviation  a  of 
a  normal  population  may  be  computed  from  a  sample  of  size  n  as 


where  («  —  l)/2  =  —  x)2.  Unlike  the  confidence  limits  for  the  mean 

(Sec.  2.1.3),  which  are  symmetric  about  the  estimate  x,  these  limits  are  not 
symmetric  about  the  estimate  s. 

Alternatively,  Appendix  Table  8  enables  us  to  obtain  90,  95,  or  99% 
confidence  limits  for  a  from  a  sample  of  size  n  by  multiplying  the  factors 
bx  and  b2 ,  given  for  /  =  //  —  1  degrees  of  freedom,  by  the  estimate  j  com¬ 
puted  from  the  sample. 

Example.  A  sample  of  20  gage  measurements  of  pressure  has  a 
sample  standard  deviation  of  11  pounds  per  square  inch.  What  are  95%> 
confidence  limits  for  the  population  standard  deviation  <r? 

From  Appendix  Table  4,  we  find 

X'.ozs.iu  '"32.85  and  X"<j75,iy  —  $-907 

With  .»•  “  11,  we  have  as  95%  confidence  limits  for  a 

(jll>y(li)  =  84  psi  and  (s«;)<(11)  =  161  psi 

Notice  that  the  limits  are  not  symmetric;  that  is.  they  arc  not  equally 
far  from  jr. 

The  following  method  is  preferable  in  practice.  From  Appendix 
Table  8,  for  f  =  n  —  1  =  19  the  factors  bx  =  0.760  and  b2  =  1.461 
yield  the  95%  confidence  limits 

(0.760) (1 1)  =  8.4  psi  and  (1.46l)(ll)  =  16.1  psi 

a.  Determination  of  Sample  Size.  We  can  determine  the  sample  size 
necessary  to  estimate  o-  within  p%  of  its  true  value  with  confidence  coeffi- 


74 


TESTS  FOR  STANDARD  DEVIATIONS 


§3.3. 


dent  1  -  a.  Appendix  Chart  IX  gives  the  desired  sample  size  as  a  function 
of  p  and  /. 

Example.  In  estimating  the  precision  of  a  new  weapon,  how  large 
a  sample  size  should  be  used  to  specify  the  standard  deviation  within 
30%  of  its  true  value  with  95%  confidence? 

From  Appendix  Chart  IX,  for  p  —  30%  on  the  curve  for  confidence 
coefficient  0.95,  we  find  21  degrees  of  freedom.  Since  for  our  test  the 
number  of  degrees  of  freedom  is  n  -  1,  the  necessary  sample  size  is  22. 

3.3.  F  Test  for  ox/o2  for  Normal  Distributions 

3.3.1.  Assumptions 

(1)  The  populations  have  normal  distributions. 

(2)  The  samples  are  random  samples  drawn  independently  from  the 
two  populations. 

3.3.2.  Test 

To  test  the  null  hypothesis  that  the  ratio  v'i/v'i  of  the  variances  of  two 
normal  populations  is  1  (i.e.,  ax  =  <j2)  at  the  significance  level  a,  on  the 
basis  of  a  sample  of  size  nx  from  population  1  and  an  independent  sample 
of  size  n2  from  population  2,  compute  the  statistic 


with  the  larger  s2  in  the  numerator  so  that  the  computed  F  is  always  greater 
than  1.  Enter  Appendix  Table  5  of  the  F  distribution  with  fx  =  n{  —  1 
degrees  of  freedom  for  the  numerator  and  f2  =  n2  —  1  degrees  of  freedom 
for  the  denominator.  Reject  the  hypothesis  if  F  exceeds  the  tabled  value 
F a/ 2(^1  —  1,  «2  -  1)  for  P(F)  =  a/2,  because  a  ratio  F  that  large  or 
larger  can  occur  with  probability  only  a  if  the  null  hypothesis  holds.  For 
an  equal-tails  test,  we  force  the  computed  F  to  be  greater  than  1  by  the 
choice  of  numerator  and  denominator;  hence,  we  use  F  >  Fa/2  as  the 
rejection  region  instead  of  the  double  region  F  >  Fa/2  and  F  <  Fi-a^ 

For  an  equal-tails  test  of  the  null  hypothesis  o\/<j\  =  R¥=  1,  compute  F 
as  the  ratio  of  s'i  and  Rsj,  using  the  larger  quantity  in  the  numerator. 
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A  one-sided  test  is  appropriate  if  we  are  interested  only  in  whether  popu¬ 
lation  1,  specified  before  observing  the  data,  has  greater  variability  than 
population  2.  We  compute 


where  s'f  represents  the  estimated  variance  of  population  1  (not  necessarily 
greater  than  rH  as  in  the  equal-tails  test).  If  F  exceeds  the  tabled  value 
Fa(n i  —  1,  ;;2  —  l),  we  reject  the  null  hypothesis  that  <Xj  <  <r2  and  decide 
that  population  1  has  more  variability  than  population  2.  If  the  computed 
F  <  1,  we  can  immediately  accept  the  null  hypothesis  that  a,  <  o r2,  since  the 
tabled  value  of  F  is  always  greater  than  1. 

Figure  3.1  shows  the  probability  density  function  of  F  for  two  pairs  of 
numbers  of  degrees  of  freedom. 


FIG.  3.1.  Frequency  Distribution  of  F  =  s\/ s\. 
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Example  (Equal-Tails  Test).  To  determine  whether  two  types  of 
rocket  have  different  dispersions  and  should,  therefore,  be  recommended 
for  different  uses,  an  F  test  is  made  on  data  already  available.  From  a 
random  sample  of  61  rockets  of  one  type,  the  computed  s2  is  1,346.89 
mils2.  From  a  random  sample  of  31  rockets  of  the  other  type,  the  com¬ 
puted  .f2  is  2,237.29  mils2.  Is  there  a  significant  difference  at  the  5%  level 
in  the  standard  deviations? 


Since  2,237.29  exceeds  1,346.89,  we  label  the  sample  of  31  rockets 
Sample  1 .  To  test  the  null  hypothesis  <rt  =  <r2,  we  compute 


F 


2,237.29 

1,346.89 


1.66 


From  Appendix  Table  5,  F.025(30,  60)  =  1.82.  Since  1.66  is  less  than 
1.82,  we  cannot  reject  the  null  hypothesis  that  the  population  standard 
deviations  are  equal.  We  need  not,  as  far  as  this  experiment  indicates, 
differentiate  between  the  two  rockets  with  respect  to  their  dispersion. 


Example  (One-Sided  Test).  A  personnel  department  plans  to  dis¬ 
tribute  a  form  to  job  applicants  to  aid  in  placement.  Two  forms,  one 
2  pages  long  and  one  3  pages  long,  involving  grading  systems,  have  been 
designed.  A  test  is  made  to  determine  whether  the  3-page  form  has  a 
larger  standard  deviation  of  scores  than  the  2-page  form.  Here  a  wide 
spread  of  scores  is  preferred. 


The  3-  by  5 -inch  index  cards  of  information  filled  out  by  each  new 
employee  during  one  week's  hiring  are  shuffled,  and  16  cards  are  selected 
at  random.  The  16  employees  thus  selected  are  requested  to  fill  out  the 
3-page  form  (Sample  1).  Their  cards  are  returned  to  the  deck  and  all 
the  cards  are  reshuffled.  Again  16  cards  are  drawn  at  random,  and  the 
employees  thus  selected  are  asked  to  fill  out  the  2-page  form  (Sample  2). 
Notice  that  the  two  samples  are  independently  drawn.  Since  the  first  16 
cards  drawn  are  replaced,  their  selection  has  no  effect  on  the  selection  of 
the  second  set  of  16.  If  an  employee’s  card  is  selected  in  both  drawings, 
he  fills  out  both  forms. 


After  scoring,  the  2-page  form  gives  s'i  =  12.3,  and  the  3-page  form 
gives  j'f  =  40.2,  Is  the  standard  deviation  for  the  3-page  form  signifi¬ 
cantly  greater  at  the  5%  level  than  that  of  the  2 -page  form? 
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To  test  the  null  hypothesis  a1  <  cr2,  we  compute 


40.2 

12.3 


3.27 


and  compare  with  Fo:>  (15,  15)  =  2.40.  Since  F  >  F  ii:„  we  reject  the  null 
hypothesis,  and  conclude  that  the  standard  deviation  of  scores  for  the 
3 -page  form  is  greater  than  that  for  the  2 -page  form.  If  the  2 -page  form 
had  shown  greater  standard  deviation  (as  could  often  happen  under  the 
null  hypothesis),  we  would  have  kept  to  our  one-sided  test  by  merely 
stating  that  no  significant  departure  had  been  shown. 


a.  Determination  of  Sample  Size.  Suppose  we  are  using  the  one-sided 
test  described  in  this  section  to  determine  at  the  significance  level  a  whether 
<*!  exceeds  a2,  with  an  assigned  risk  /3  of  accepting  the  null  hypothesis 
that  c r1/a2  ~  1  when  actually  al/a2  -  A  >  1.  If  samples  of  equal  size 
M,  =  n2  =  n  are  to  be  taken  from  the  two  populations,  the  necessary  sample 
size  n  can  be  found  in  Appendix  Chart  VIII  opposite  A  and  [i  for  the 
special  case  «  =  0.05. 


Example.  It  is  suspected  that  the  aiming  error  of  a  pilot  at  high  alti¬ 
tudes  will  be  greater  (1)  without  the  use  of  a  pressurized  suit,  than 
(2)  with  a  pressurized  suit.  A  test  is  to  be  made  by  having  a  pilot  fire 
ni  ~  n2  —  n  shots  under  each  condition.  We  will  accept  a  risk  of  0.1  of 
deciding  that  the  standard  deviations  <rx  and  a2  of  the  shots  are  equal, 
when  actually  <r1  =  2 <j2.  What  sample  size  should  be  used  if  the  test  is 
to  be  made  at  the  5%  level  of  significance? 

From  Appendix  Chart  VIII  for  A  =  2  and  /?  =  0.1,  we  find  n  —  20. 


3.3.3.  Confidence  Intervals 


A  set  of  100(1  -  a)%  confidence  limits  for  the  ratA,  ox/a2  may  be 
computed.  The  lower  limil  is 

h  _ 1 _ 

*2  “  1,  »2  ”  1) 

and  the  upper  limit  is 


_ 1 _ 

V  Fi-a/2(>h  -  1,  »s  “  1) 


£i 

S-2 


V  Pari{n2  —  1,  »1  —  1) 


n 
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The  latter  form  of  the  upper  limit  should  be  used  for  computation. 

a.  Determination  of  Sample  Size.  We  can  determine  the  sample  size 
necessary  for  estimating  the  ratio  <r1/(r2  within  p%  of  its  true  value  with 
confidence  coefficient  0.90,  0.95,  or  0.99  from  the  number  of  degrees  bf 
freedom  /,  =  f2  =  /,  shown  in  Appendix  Chart  X. 

Example.  How  large  a  sample  should  be  drawn  from  each  of  two 
populations  in  order  to  estimate  the  ratio  ax/a2  within  20%  of  its  true 
value  with  confidence  coefficient  0.95? 

From  Appendix  Chart  X  for  p  =  20%,  we  find  the  number  of  degrees 
of  freedom  /  =  99.  Therefore  the  sample  size  »,  =  n2  =  n  =  99  +  1 
=  100. 

NOTE;  From  Appendix  Charts  IX  and  X  it  follows  that  a  total  of 
slightly  more  than  four  times  as  many  observations  are  needed  to  compare 
two  unknown  standard  deviations  with  any  given  accuracy  as  are  needed  to 
compare  one  unknown  standard  deviation  with  a  known  value  with  the 
same  accuracy. 

3.4.  Ai  Test  for  Homogeneity  of  Variances 

3.4.1.  Assumptions 

(1)  The  populations  have  normal  distributions. 

(2)  The  samples  are  random  samples  drawn  independently  from  the 
respective  populations, 

3.4.2.  Test 

To  test  the  null  hypothesis  that  k  populations,  from  which  k  samples  of 
sizes  nu  u2,  •••,«*  have  been  drawn,  have  the  same  variances,  we  compute 
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where 

/ 1  =  //(  —  1,  the  number  of  degrees  of  freedom  in  computing  s\  for 
the  /th  sample 

<#>  =  2/i 

1  =  1 

.r'f  =  estimate  of  the  variance  of  the  /th  population 
loge  =  2.3026  logi0 

If  we  wish  to  use  common  logarithms  (base  10),  we  substitute  "logi0”  in 
the  formula  wherever  "log,.”  occurs  and  multiply  the  final  result  by  2.302 6. 

Appendix  Table  6  gives  values  of  M  at  the  5%  significance  level.  If  the 
computed  value  of  M  exceeds  all  the  values  in  the  rows  for  the  given  k,  it 
is  significant  at  the  5%  level,  and  we  reject  the  null  hypothesis  that  the 
variances  are  all  equal.  If  M  is  less  than  all  the  values  in  the  rows,  it  is  not 
significant  at  the  5%  ievel,  and  we  cannot  reject  the  null  hypothesis.  If  M 
falls  within  the  range  of  tabled  values  for  the  given  k,  a  more  refined 
technique  is  necessary.  We  compute 

=  2  f  -  T 

i=l  Ji  Y 

To  be  significant,  M  must  exceed  the  appropriate  value  under  (interpo¬ 
lating  between  values  of  Cj  if  necessary),  opposite  the  given  k.  For  the 
computed  the  appropriate  critical  value  lies  between  the  value  in  row  (a) 
and  the  value  in  row  (b). 

The  (a)  value  is  a  maximum,  obtained  when  all  the  values  /4  are  equal. 
The  (b)  value  is  a  minimum,  obtained  when  some  of  the  values  /;  =  1  and 
the  rest  are  infinite.  In  case  neither  condition  (a)  nor  condition  (b)  applies 
and  the  computed  M  lies  between  the  (a)  and  (b)  values  (as  will  rarely 
happen),  this  borderline  case  may  be  attacked  by  methods  of  Ref.  3.5. 
Ref.  3.5  gives  both  1  and  5%  points. 

Refer  to  Chapter  5  for  a  test  of  homogeneity  of  means,  which  can  be 
applied  if  the  M  test  shows  that  the  variances  are  homogeneous. 

Example.  Table  3.2  gives  statistics  on  10  samples  of  6  from  the 
production  of  10  different  machines.  Is  the  variability  the  same  for  all 
machines  ? 
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TABLE  3.2.  Data  on  Samples  From  10  Machines 


Machine 

!oge  s\ 

1 

95.46 

4.559 

11.7 

2 

26.16 

3.264 

6.2 

3 

9.20 

2.219 

1.0 

4 

14.70 

2.688 

2.5 

5 

14.96 

2.705 

3.2 

6 

14.16 

2.650 

3.2 

7 

25.36 

3.233 

6.2 

8 

24.16 

3.185 

8.8 

9 

3.06 

1.118 

6.3 

10 

26.70 

3.285 

4.5 

Total 

253.92 

28.906 

53.6 

With  k  —  10,  =  n  —  6,  and  ft  =  f  =  3,  we  compute 

</>  =  ""  50 

i-i 


SM 

<£ 


25.392 


10 =  3.234 
9 

M  =  (50)  (3.234)  -  (5)  (28.906)  =  17.17 

In  Table  6  opposite  k  =  10,  we  see  that  the  values  range  from  16.92  to 
19.89.  Since  this  range  includes  our  M  value,  we  go  on  to  compute 


=  V  4 - -  =  2  -  0.02  =  1.98 

The  value  M  —  17.17  is  less  than  the  (a)  values  for  both  =  1.5  and 
Ci  =  2.0;  hence  no  interpolation  is  necessary  for  cx  =  1.98.  We  conclude 
that  no  difference  among  the  variances  has  been  shown  at  the  5%  signifi¬ 
cance  level.  (Note  that  here  the  (a)  value  is  the  pertinent  one,  since  all 
the  /i  are  equal.) 
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Chapter  4 


TESTS  OF  DISTRIBUTIONS  AS  A  WHOLE 
AND  ALLIED  PROBLEMS 

4.0.  Introduction 

In  Chapter  2,  tests  were  given  to  determine  whether  a  sample  came  from 
a  population  having  mean  /t.  In  Chapter  3,  there  were  tests  designed  to 
show  whether  a  sample  came  from  a  normal  population  having  standard 
deviation  a.  In  this  chapter,  tests  are  given  to  determine  whether  a  sample 
comes  from  a  population  having  a  distribution  of  any  specified  form.  Such 
information  is  often  valuable.  For  instance,  several  of  the  tests  of  Chapter 
2  and  Chapter  3  required  a  normal  distribution.  With  the  criteria  of  this 
chapter,  we  can  test  whether  a  normal-distribution  requirement  is  satisfied. 

Several  miscellaneous  topics  are  treated,  such  as  randomness,  sensitivity 
testing,  independence,  gross  errors,  and  tolerance  intervals. 

4.1.  Run  Test  for  Randomness 

In  all  the  tests  given  in  this  Manual,  it  is  required  that  the  sample  be 
irawn  at  random  from  its  population.  (See  Sec.  1.1.1.)  Sometimes  it  is 
difficult,  without  examining  the  sample  itself,  to  be  sure  that  the  selection 
does  not  involve  some  hidden  trend.  The  run  test  for  randomness,  which 
can  be  applied  as  a  one-sided  or  an  equal-tails  test,  is  illustrated  by  the 
following  example. 

Example.  Coded  measurements  of  the  weights  of  20  grains  are 
recorded  in  Table  4.1  in  the  order  in  which  they  were  obtained.  Test 
at  the  5%  level  of  significance  whether  the  20  weights  represent  a  ran¬ 
dom  sample  from  which  to  estimate  the  average  weight  for  the  population 
of  all  such  grains. 

We  find  that  the  median  of  the  20  readings  is  26  (that  is,  the  average 
of  the  tenth  and  eleventh  when  the  readings  are  arranged  in  the  order  of 
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TABLE  4.1.  Coded  Measurements  of 
Weights  of  20  Grains 


12 

23 

29 

28 

14 

25 

10 

28 

22 

27 

25 

30 

21 

25 

30 

30 

22 

28 

30 

28 

size).  We  mark  with  minus  signs  readings  less  than  26  and  with  plus 
signs  those  greater  than  26,  as  in  Table  4.2.  Here  we  are  interested  not 
just  in  the  numbers  of  these  signs  (as  in  the  sign  test,  Sec.  2.5.2a),  but 
in  how  they  are  interspersed.  If  the  sample  is  random,  we  should  expect 
low  values  and  high  values  (values  below  26  and  values  above  26)  to 
be  fairly  well  scattered;  for  instance,  we  should  not  expect  all  the  low 
values  to  precede  all  the  high  values.  On  the  other  hand,  we  should  not 
expect  them  to  alternate.  If  we  count  an  unbroken  sequence  of  plus 
signs  or  minus  signs  as  one  "run,”  the  data  show  that  the  number  of 
runs  v  is  6. 

Appendix  Table  10  is  a  table  of  critical  values  for  runs.  The  table  is 
entered  with  quantities  nx  and  n2,  the  numbers  of  objects  of  the  two 
kinds  in  the  sequence.  In  this  case  nx  —  n2=  10,  since  there  are  10  items 
below  and  10  items  above  the  median.  Thus  we  find  vt01s  =  6  and 
C025  ==  16.  A  number  of  runs  greater  than  6  but  less  than  16  is  consist¬ 
ent  with  the  null  hypothesis  of  randomness  at  the  5%  level  of  signif- 

TABLE  4.2.  Coded  Weights  of  20  Grains  Separated  in  Runs 


i 


12- 

14- 

22- 

21- 

22- 

23- 

25- 


2 

27  + 


3 

25- 


4  5 


28+  10- 

29+  25- 


6 


30+ 
30+ 
28  + 
28  + 
30+ 
30  + 
28  + 


Run  no. 
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icance.  A  value  of  6  or  beiow,  or  of  16  or  above,  indicates  that  the 
hypothesis  should  be  rejected  at  this  level,  since  if  the  hypothesis  is  true, 
a  number  of  runs  either  so  small  or  so  large  would  happen  with  prob¬ 
ability  not  more  than  0.05.  (See  Sec.  1.1.7  for  a  general  discussion  of 
tests  of  hypotheses.) 

Since  here  the  conclusion  is  that  the  sample  is  not  random,  we  should 
look  for  possible  causes.  If  the  weight  readings  were  taken  on  the  20 
grains  at  the  time  of  production  and  in  the  order  of  their  extrusion,  we 
should  look  for  a  factor  (such  as  the  warming-up  of  a  machine)  that 
would  cause  an  increase,  with  time,  in  the  weight  of  the  grains.  If  rejec¬ 
tion  had  bee.]  on  the  grounds  that  there  were  too  many  runs  16), 
we  should  look  for  a  factor  in  production  that  tends  to  cause  alternate 
heavy  and  light  giains. 

Another  application  of  the  run  test  appears  in  Sec.  4.7.2.  For  a  test  of 
trend  in  means,  see  Sec.  2.8.  For  a  test  of  the  randomness  of  occurrence  of 
defects  in  production,  see  the  example  in  Sec.  4.2. 

4.2.  The  xz  Test  for  Goodness  of  Fit 

Suppose  the  sample  values  from  an  experiment  fall  into  r  categories.  To 
decide  at  the  significance  level  «  whether  the  data  constitute  a  sample  from 
a  population  with  distribution  function  /(x),  we  first  compute  the  expected 
number  of  observations  that  would  fall  in  each  category  as  predicted  by 
f(x)  The  grouping  should  be  arranged  so  that  this  theoretical  frequency 
is  at  least  five  for  each  category.  If  necessary,  several  of  the  original  groups 
may  be  combined  in  order  to  assure  this.  To  compare  the  observed  fre¬ 
quencies,  nx  for  the  /'th  category,  with  the  expected  (theoretical)  frequencies, 
eu  compute 

^ 

t  -  i  €  j 

If  the  calculated  value  of  x2  exceeds  the  value  xir-i-s  for  /  —  r  —  1  —  g 
degrees  of  freedom  in  Appendix  Table  4,  reject  at  the  significance  level  a 
the  null  hypothesis  that  the  distribution  function  is  /(x).  An  explanation  of 
the  subtracted  g  follows. 

If,  before  the  experiment,  the  distribution  function  fix )  to  be  tested 
is  completely  specified  (for  instance,  in  un  experiment  to  determine  whether 
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a  coin  is  "true”  by  comparing  the  numbers  of  heads  and  tails  tossed  with 
those  predicted  by  the  binomial  distribution  with  proportion  p  =  Vz),  then 
g  =  0.  If,  before  the  experiment,  we  can  specify  the  type  or  form  of  the 
distribution  f(x )  which  we  want  to  test,  but  we  cannot  specify  f(x)  com¬ 
pletely  (for  instance,  in  an  experiment  to  see  whether  aiming  errors  have 
a  normal  distribution,  or  in  an  experiment  to  see  whether  burning  time  has 
a  normal  distribution  with  standard  deviation  0.01  second),  then  g  stands 
for  the  number  of  quantities  necessary  to  complete  the  specification.  These 
quantities  must  be  obtained  as  estimates  from  the  experimental  data  them¬ 
selves.  For  instance,  to  decide  whether  aiming  errors  have  a  normal  dis¬ 
tribution,  we  compute  x  from  the  sample  as  an  estimate  of  /x  and  s  as  an 
estimate  of  cr,  and  since  we  use  g  =  2,  the  test  shows  whether  the  observa¬ 
tions  are  consistent  with  the  hypothesis  that  the  data  come  from  a  normal 
distribution.  To  decide  whether  burning  time  has  a  normal  distribution  with 
standard  deviation  0.01  second,  we  need  to  estimate  /x  only;  here  g  =  1. 

Example.  In  Sec.  1.2.3,  plant  records  are  given  showing  the  fre¬ 
quency  of  occurrence  of  x  defects  in  a  production  day;  these  data  are 
repeated  in  '’"'able  4.3.  Test  at  the  5%  level  of  significance  whether  defects 
are  occurring  randomly. 


x  'YBLE  4.3.  Fitting  a  Poisson  Distribution 


No.  of  defects 
produced  in 
a  day, 

X 

_ 1 

Actual  no. 
of  occurrences 
of  x  defects  in  a  day, 
m 

i  _  . 

Theoretical  no. 
of  occurrences  of 
x  defects  in  a  day, 

0 

102 

95.3 

1 

59 

71.1 

2 

31 

26.5 

3  1 

8] 

6.6] 

4 

► 

0 

i  9 

1.2 

-8.0 

5 

1 

0.2 

6  or  more 

0 

0.0 

If  defects  are  occurring  .andomly,  the  relative  frequencies  should  fol¬ 
low  the  Poisson  distribution 

m1  e~m 

P(x)  =  f~  (x  =  0,  1,2,  •  •  •) 


§4.3. 


TESTING  THE  FIT  OF  A  NORMAL  DISTRIBUTION 


87 


Since  we  do  not  know  which  specific  Poisson  distribution  to  expect 
(namely,  which  value  of  m  to  use),  we  must  use  for  m  the  mean  of  the 
sample,  0.74627.  We  multiply  the  total  number  of  days,  201,  by  the 
theoretical  relative  frequency  in  each  category,  P(x),  to  obtain  the 
theoretical  number  in  each  category. 

In  order  to  have  the  theoretical  frequency  in  each  category  at  least  5, 
we  group  the  last  four  figures  together.  We  then  compute 


=  +  (-12'1)2  +  {i_52  +  0-Q)2 

95.3  71.1  26.5  8.0 


3.42 


Taking  g  =  1,  because  we  estimated  m  from  the  sample,  we  find  from 
Appendix  Table  4  that  X205  4-1-1  =  5-991.  Since  3.42  does  not  exceed 
this  value,  we  have  no  reason  to  reject  the  hypothesis  that  defects  are 
occurring  randomly. 


4.3.  Testing  the  Fit  of  a  Normal  Distribution 

To  test  the  hypothesis  that  a  random  sample  comes  from  a  population 
having  a  normal  distribution,  we  may  fit  a  normal  curve  to  the  data  and 
then  test  to  see  whether  the  hypothesis  is  justified.  The  testing  may  be 
done  by  means  of  a  g2  test  for  goodness  of  fit,  in  which  case  the  mean 
and  standard  deviation  for  the  Feted  normal  curve  should  be  estimated 
from  the  grouped  sample  data.  (See  Chernoff  and  Lehmann,  Annals  of 
Mathematical  Statistics,  vol.  25,  pp.  579-586  for  the  error  in  applying  the 
X -  test  when  ungrouped  data  are  used  in  fitting.) 

An  obvious  but  rough  check  on  normality  can  be  made  by  plotting  the 
fitted  normal  curve  to  the  same  scale  as  the  histogram  of  the  grouped  data. 
(See  Fig.  4.1  below.)  A  more  convenient  graphical  approach  calls  for  the 
use  of  normal-probability  graph  paper  (Keuffel  &  Esser  Co.,  No.  358-23 
and  359-23).  If  the  population  is  normal,  a  plot  of  the  sample  cumulative 
percentage  frequencies  should  approximate  a  straight  line.  However,  in  both 
of  these  graphical  approaches  there  will  be  difficulty  in  judging  by  eye  how 
much  departure  from  the  ideal  pa  tern  we  must  expect.  Even  the  plot  for  a 
sample  of  1,000  drawn  from  a  population  known  to  be  normal  often 
exhibits  substantial  irregularity. 
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Besides  the  approach  through  goodness  of  fit  of  the  best  possible  normal 
curve,  there  are  other  tests  such  as  the  bx,  b2 ,  and  a  tests;  however,  many 
statisticians  now  consider  the  x2  test  superior. 

Example.  A  grouped  sample  of  300  observations  of  lateral  deflection 
yields  x  =  0.88  mil  and  s  =  52.53  mils  as  estimates  of  the  population 
mean  and  standard  deviation.  The  computation  was  made  from  the 
grouped  data  of  Table  4.4,  using  Sheppard’s  correction.  Test  at  the  5% 
level  of  significance  whether  these  data  came  from  a  normal  distribution. 

The  observations  were  to  the  nearest  0.1  nil;  therefore,  intervals 
beginning  at  —150.05,  -130.05,  etc.,  were  used  to  avoid  ambiguity  in 
grouping.  Each  group  end-point  is  shown  in  column  2  as  a  number  of  s 
units  away  from  x,  so  that  Appendix  Table  1  (the  cumulative  normal 
distribution  table)  can  be  used  to  find  column  3.  Differencing  column  3 
gives  the  relative  frequency  for  each  group  as  predicted  by  the  normal 
distribution.  This  frequency,  shown  in  column  4,  is  multiplied  by  300  to 
give  the  theoretical  frequencies  e{  of  column  5.  Since  the  theoretical 
frequencies  are  very  small  in  the  first  and  last  few  groups,  all  deflections 
less  than  -110.05  are  combined  int<5  one  new  group  and  all  deflections 
greater  than  109.95  into  another.  Because  the  net  number  of  groups  is 
13,  and  sample  statistics,  x  and  s,  were  used  to  estimate  the  corresponding 
population  measures,  g  and  er,  the  number  of  degrees  of  freedom  for  x2 
is  13  —  1  —  2  =  10.  From  Appendix  Table  4,  we  find  that  x205  10  = 
18.31,  and  summing  the  values  in  column  7,  we  find  that  x2  =  4.89.  We 
conclude  that  at  the  5%  level  of  significance  the  sample  distribution  is 
consistent  with  the  hypothesis  that  the  parent  distribution  is  normal. 

A  frequency  histogram  for  the  grouped  data  of  Table  4.4  is  given  in 
Fig.  4.1,  along  with  the  normal  curve  with  the  same  mean  and  standard 
deviation. 

4.3.1.  Transformations  To  Obtain  Normality 

Many  physical  situations  produce  data  that  are  normally  distributed; 
others,  data  that  follow  some  other  known  distribution;  and  still  others,  data 
that  can  be  transformed  to  normal  data.  For  example,  when  the  reaction 
rate  in  a  chemistry  experiment  is  proportional  to  the  concentration  of  react¬ 
ing  substances  the  distribution  of  rates  is  not  likely  to  be  normal,  but  the 
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TABLE  4.4.  Grouped  Data  Used  in  Computation 
for  x2  Test  of  Normality 


distribution  of  their  logarithms  may  be  theoretically  normal.  The  most  com¬ 
mon  transformations  are  y  =  log*  (frequently  used  in  sensitivity  testing), 
and  y  =  "Y  x.  An  easy  way  to  decide  whether  one  of  these  transformations  is 


§4.4. 


90  TESTS  OF  DISTRIBUTIONS  AS  A  WHOLE 


LATERAL  DEFLECTION,  MILS 


FIG.  4.1.  Frequencies  of  Lateral  Deflections,  Showing  the  Approximating  Normal 
Curve  With  x  =  0.88  Mil  and  s  =  52.53  Mils  for  Sample  Size  300.  (Precise  inter¬ 
vals  begin  at  —150.05,  —130.05,  etc.) 

likely  to  produce  normality  is  to  make  use  of  special  graph  papers  that  are 
commercially  available.  For  instance,  if  y  =  1  ogx  produces  normality,  the 
sample  cumulative  distribution  curve  plotted  on  log- probability  graph  paper 
will  approximate  a  straight  line. 

4.4.  Confidence  Band  for  Cumulative  Distribution 

To  construct,  on  the  basis  of  a  random  sample  of  size  «,  a  100(1  —  a)  % 
confidence  band  for  the  cumulative  distribution  of  the  population,  draw 
the  cumulative-percentage  histogram  for  the  sample.  Find  in  Appendix 
Table  11a  quantity  da  corresponding  to  the  sample  size  n  and  the  con¬ 
fidence  coefficient  1  -  a.  Draw  two  "staircase”  lines  parallel  to  the  sample 
histogram,  one  \00da  percentage  units  above  the  histogram,  the  other 
100*4  percentage  units  below.  In  the  long  run,  confidence  bands  so  con¬ 
structed  for  random  samples  from  populations  will,  in  100(1  —  a)%  of 
the  cases,  completely  contain  the  population  cumulative-distribution  curve. 

Example.  In  Sec.  2.5.3b,  data  were  given  on  the  lengths  in  mm  of 
parts  produced  by  two  machines.  The  lengths  for  Machine  2,  x2/, 
arranged  in  order  of  size,  are: 
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93.75  94.36  94.73  95.02  95.31 

93.90  94.50  94.82  95.21  96.16 

Construct  the  sample  cumulative  histogram  and  a  95%  confidence  band. 

The  cumulative  histogram  of  these  data  (Fig.  4.2)  shows  that  0%  of 
the  sample  had  lengths  less  than  93.75;  10%,  lengths  less  than  93.90; 
20%,  lengths  less  than  94.36;  .  .  .  ;  90%,  lengths  less  than  96.16; 
and  that  all  10  lengths  were  96.16  or  less.  Reference  to  Appendix 
Table  11  gives  for  n  =  10  and  a  =  0.05  the  value  da  —  0.41.  Thus  we 
construct  the  boundaries  of  the  confidence  band  41  percentage  units 
above  and  41  percentage  units  below  the  histogram. 

If  we  superimpose  on  a  graph  like  Fig.  4.2  the  cumulative  distribution 
curve  F(x)  for  some  theoretical  distribution  f(x)  and  if  the  curve  leaves 
the  confidence  band  at  any  point,  we  can  reject  at  the  significance  level  a 
the  null  hypothesis  that  the  population  was  distributed  according  to  /(*). 
Thus,  the  confidence  band  gives  us  a  possible  test  method,  though  the 
method  of  Sec.  4.2  is  more  powerful. 

If  we  wish  to  test  whether  a  distribution  is  of  some  general  form,  such 
as  normal,  but  not  to  test  specific  values  of  constants  (parameters),  esti¬ 
mates  of  the  latter  are  made  from  the  sample.  The  test  may  '’till  be  made, 
but  it  lacks  discrimination,  in  the  sense  that  it  will  not  reject  the  distribution 
unless  its  form  is  very  different  from  that  of  the  null  hypothesis.  It  may  be 
used  correctly  to  reject  such  a  distribution  at  a  significance  levei  no  greater 
than  a,  but  it  will  reject  much  too  seldom. 

4.5.  Sensitivity  Testing 

"Sensitivity  testing”  is  testing  in  which  an  increasing  percentage  of  items 
fail,  explode,  or  die  as  the  severity  of  the  test  is  increased.  In  such  testing 
we  cannot  measure  the  precise  severity  of  test  (that  is,  the  precise  magnitude 
of  the  variable  concerned)  which  would  barely  result  in  failure,  but  can 
only  observe  whether  an  applied  severity  results  in  failure,  or  does  not. 
For  instance,  in  a  test  to  discover  the  minimum  range  to  target  at  which  a 
particular  rocket  will  function  as  intended,  we  might  find  a  given  range  to 
be  either  too  short  for  proper  functioning  or  long  enough  for  proper  func¬ 
tioning  (but  probably  not  the  minimum  length).  The  rocket  expended  in 
the  first  trial  cannot  be  used  again,  The  same  difficulty  arises  in  tests  for 
determining  the  lethal  dosage  of  a  poison  (Ref.  4.2).  Any  given  dosage 
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96  96.5 


FIG.  4.2.  Cumulative  Histogram  With  95%  Confidence  Band. 


either  kills  or  does  not  kill  the  experimental  animal.  The  animal,  having 
been  affected  or  killed  by  the  first  trial,  cannot  be  used  again.  In  the 
determination  of  critical  height  (the  minimum  height  from  which  a  weight 
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must  be  dropped  on  a  charge  to  explode  it) ,  a  charge  is  either  demolished 
or  changed  in  its  characteristics  as  a  result  of  the  trial.  In  all  these  cases  we 
see  that  each  item  has  its  own  critical  level  of  severity,  so  that  in  a  whole 
population  of  items  there  is  a  probability  distribution  of  critical  levels. 

Sensitivity  tests  for  finding  the  mean,  standard  deviation,  or  percentiles 
in  such  cases  may  be  analyzed  by  "staircase”  methods  (see  Sec.  4.5.1  for  a 
particular  staircase  method  and  Ref.  4.3  for  others)  or  by  "probit  analysis” 
(see  Ref.  4.2  and  4.6).  Both  methods  depend  upon  an  assumption  of 
normality  in  the  quantity  being  studied.  A  rough  but  quick  method  for  the 
critical-height  test,  for  instance,  is  to  plot  the  percentage  of  items  explod¬ 
ing  below  each  given  height  on  the  commercially  available  arithmetic- 
probability  graph  paper  and  fit  a  straight  line  by  eye.  This  fitting  provides 
a  check  of  normality  also,  and  may  suggest  a  transformation  of  the  inde¬ 
pendent  variable  which  will  give  approximate  normality  if  the  data  do  not 
lie  near  a  straight  line.  Probit  analysis  does  the  fitting  more  precisely  and 
gives  a  measure  of  the  precision,  but  staircase  methods  are  more  efficient 
when  they  are  applicable.  The  staircase  methods  are  recommended  when 
the  effect  on  each  item  is  known  immediately  or  soon  after  each  application, 
and  when  the  independent  variable  can  be  readily  adjusted.  For  instance, 
staircase  methods  are  appropriate  for  the  charge  critical-height  test  described 
above.  Probit  analysis  is  recommended  when  it  is  not  practical  to  measure 
one  item  at  a  time;  for  example,  when  the  ultimate  effects  of  given  amounts 
of  radiation  are  to  be  measured  on  a  group  of  small  animals,  such  as  a  pen 
of  hamsters  or  mice.  If  it  is  inconvenient  to  adjust  the  independent  variable, 
as  in  the  case  of  tests  of  rockets  at  different  temperatures,  probit  analysis  is 
appropriate.  For  detailed  application  of  probit  analysis  and  the  method  of 
Sec.  4.5.1  to  fuze  safety  and  reliability,  see  Ref.  4.1 1. 

4.5.1.  Up-snd-Down  Method 

A  valuable  and  frequently  used  method  of  sensitivity  testing  of  the  stair¬ 
case  type  is  the  "up-and-down,”  or  "Bruceton,”  method  (Ref.  4.5,  pp. 
279-87).  Normal  distribution  and  ready  adjustment  of  the  independent 
variable  to  any  prescribed  level  are  required.  The  method  is  especially 
applicable  to  estimating  the  mean  /x  (50%  point)  of  the  distribution  because 
it  concentrates  the  observations  in  the  neighborhood  of  the  mean;  it  can 
also  be  used  for  estimating  the  standard  deviation  <r  and  other  percentage 
points,  though  less  accurately. 
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Example.  For  estimating  the  distribution  of  the  minimum  functioning 
range  of  a  type  of  rocket  against  a  given  type  of  target,  the  target  would 
first  be  placed  at  a  range  /  (the  best  guess  for  jx,  the  range  at  which  50% 
of  the  rockets  would  function,  though  it  is  not  necessary  that  f  be  a  good 
guess).  A  rocket  is  fired  at  the  target.  If  it  functions,  a  second  rocket  is 
fired  at  a  target  at  range  /  —  d,  where  d  is  chosen  beforehand  as  about 
0.5or  if  fi  is  to  be  estimated  vith  maximum  precision,  and  as  about  1.5a 
if  the  10  or  90%  point  is  to  be  estimated  with  maximum  precision.  If  the 
first  rocket  does  not  function,  the  second  rocket  is  fired  at  a  target  at  range 
y  +  d.  Subsequent  firings  are  at  similarly  altered  ranges;  i.e.,  the  target 
or  the  launcher  is  moved  closer  by  d  after  f  mctioning,  farther  by  the 
same  amount  after  nonfunctioning,  until  all  the  rockets  of  the  prescribed 
sample  are  expended.  The  results  can  be  recorded  as  shown  in  Fig.  4.3, 
where  x  denotes  a  functioning  rocket  and  o  a  nonfunctioning  rocket.  The 
notation  x  and  o  could  be  reversed  or  replaced;  in  some  other  type  of  test 
the  probability  of  explosion  might  decrease  as  distance  increases,  for 
example. 


500  r 


?400 

g  300 

<  200 
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100 
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X  o 

O  X  X  0 
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X  O  X  X 

o  X  O  X  o 

o  o 


FIG.  4.3.  Type  of  Chart  Used  in  Up-and-Down  Method  of  Sensitivity  Testing. 


The  analysis  involves  only  the  symbol  occurring  less  frequently.  Since 
there  are  13  u’s  and  12  x’s,  the  latter  are  used.  Let  y0  be  the  smallest  range 
(lowest  level)  of  the  symbol  occurring  less  frequently;  let  y,-  =  v0  +  id; 
and  let  nx  be  the  number  of  these  symbols  at  the  level  yx.  Then  we  can 
summarize  the  data  and  analysis  as  in  Table  4.5,  and  make  the  following 
estimates  in  terms  of  the  column  sums  N,  A ,  and  B. 
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TABLE  4.5.  Example  of  Data  Analysis  for  Up-and-Down 
Method  of  Sensitivity  Testing 


Range 

i 

ni 

/«< 

500 

3 

1 

3 

9 

400 

2 

2 

4 

8 

300 

1 

5 

5 

5 

200 

0 

4 

0 

0 

Total 

N  =  12 

A  =  12 

B  —  22 

Estimate  of  /x:  y  =  y0  +  d 

(+  if  based  on  symbol  at  smallest  value,  here  o\ 
-  if  based  on  symbol  at  largest  value,  here  x) 


Estimate  of  o:  s  =  1.620^ 


+  0.029 


) 


/  NB  -  A* 

\  N2 

(valid  only  if  (NB  -  A2)/N2  >  0.3) 


Estimate  of  100/?%  point  fi  +  zp cr:  y  +  ZpS 

(zp  is  the  z  for  F(z)  =  p  in  Appendix  Table  2) 


Estimate  of  standard  deviation  of  y:  sp  — 


6s  +  d 
7V*N 


Estimate  of  standard  deviation  of  s\  r. 


(valid  only  if  d  <  3<r) 
l.lj  +  0.3  s2/d 

V”N 

(valid  only  if  d  <  2a) 


Estimate  of  standard  devi?*ion  of  y  +  Jp+epg  =  V  4  + 

Approximate  100(1  -  a)%  confidence  limits  for  any  parameter  are 
obtained  by  taking  its  estimate  and  adding  and  subtracting  ta/, 
times  the  estimate  of  its  standard  deviation.  Thus,  applying  the  formulas 
given  above  to  the  data  of  Table  4.5,  we  have: 
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y  -  200  +  100 


CM) 

,i  =  162  +  0.029^  =  140 


=  250 


10%  point  =  250  +  (-1.28)  (140)  =  71 
_  6(140)  +  100  _ 


7(3.46) 


=  38.8 


_  1.1(140)  4-  0.3(196)  = 
3.46 


.»u  =  ~ 


=  61.5 


,.aMI  -  V  (38.8)*  +  (L28)2  (61.5)*  =  87.8 
Approximate  95%  confidence  limits  for  the  10%  point  are 


71  ±  (2.201)  (87.8)  =  71  2:  193  or  -122  and  264 

In  this  example,  the  distribution  is  certainly  confined  to  positive  values, 
so  -122  can  be  replaced  by  0. 

The  general  formulas  shown  above  can  also  be  used  to  determine 
sample  size.  In  the  present  example,  since  the  parameters  are  seen  to 
be  but  poorly  estimated  by  this  sample  of  size  25,  a  larger  sample  size 
would  be  desirable.  If  an  estimate  of  a  can  be  made  from  previous 
experience  or  design  aims  before  the  experiment,  this  estimate  should 
be  substituted  for  .r  in  the  formula  shown  above  for  sTl,  or  Sy „ 
in  order  to  determine  the  sample  size  required  for  a  prescribed  precision. 
The  total  sample  size  is  at  least  tu  ice  the  N  of  the  formulas  given  above. 
Thus,  to  determine  the  10%  point  within  ±50  with  95%  confidence, 
(^.(ilt>,  v-i)  -ftf-i.iix.s  must  be  not  greater  than  50,  so  that  Jy-i.a8s  must  be  about 
25  or  its  square  about  625.  If,  from  previous  experience,  <?  is  thought 
to  be  about  100,  and  d  is  consequently  taken  as  1.5cr  =  150,  then 


625  = 


1 

N 


+  (l.28)“ 


^110  + 


3,000\'J 

150  ) 


39400 

N 


N  -  65 
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Therefore,  the  required  number  of  rockets  is  about  130.  In  this  way  the 
cost  of  the  test  required  for  the  prescribed  precision  may  be  found.  If 
this  cost  is  prohibitive,  it  becomes  necessary  to  re-examine  the  importance 
attached  to  knowing  the  desired  parameter  with  the  prescribed  precision, 
and  to  rebalance  the  costs  and  the  risks  involved. 

4.6.  Test  for  Independence  in  Contingency  Tables 

Table  4.6  shows  a  sample  of  size  n  classified  according  to  two  charac¬ 
teristics  (I  and  II).  The  number  of  sample  members  in  the  cell  in  the  /tn 
row  and  the  ;th  column  is  tji}.  Such  an  array  is  called  a  contingency  table. 


TABLE  4.6.  Contingency  Table  for  Two 
Characteristics  of  a  Sample 


Character¬ 
istic  II 

Charac 

teristic  I 

Total 

1 

2 

.  .  . 

s 

1 

tin 

tin 

tin 

th. 

2 

n  ji 

tin 

n„ 

th. 

r 

tin 

•  .  » 

Hr* 

fir. 

Total 

n.i 

tl.i 

.  .  . 

ft.. 

n 

If  the  two  characteristics  are  independent  of  each  other,  the  expected 
number  of  sample  members  in  any  cell  is  calculated  by  simple  proportion 
from  the  marginal  totals  to  be  n^n^/n.  We  use  the  x2  test  to  test  the  null 
hypothesis  that  the  two  characteristics  are  independent  by  comparing 

,  =  y  (la  =  n  (y  ^  -  i) 

with  the  value  in  Appendix  Table  4,  rejecting  the  hypothesis  at  the  signif¬ 
icance  level  a  if  x2  >  Xa,(r-j)(s-D-  The  number  of  degrees  of  freedom, 
/,  is  (r  -  1  )(s  —  1).  As  in  Sec.  4.2,  the  application  of  the  x2  test  is  reliable 
if  every  expected  frequency  is  at  least  5. 
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In  the  contingency  table,  the  headings  of  the  rows  and  columns  need 
not  be  numerical  groupings.  The  following  combinations,  for  instance,  are 
possible: 

Characteristic  I,  degree  of  damage:  no  damage,  damaged  slightly,  com¬ 
pletely  demolished. 

Characteristic  II,  age  of  structure:  old,  new. 

Characteristic  I,  estimate  of  applicant’s  ability:  excellent,  good,  fair,  poor. 
Characteristic  II,  person  making  estimate:  Appleby,  Badger,  Christofer. 

Example.  An  experiment  on  75  rockets  yields  data  on  the  character¬ 
istics  of  lateral  deflection  and  range  as  shown  in  Table  4.7.  Test  at  the 
5%  level  of  significance  the  hypothesis  that  these  two  characteristics  are 
independent. 


TABLE  4.7.  Contingency  Table  for  Experiment 
With  75  Rockets 


Range,  yd 

Lateral  deflection,  mils 

Total 

-250  to  -51 

-  50  to  49 

50  to  199 

0  to  1,199 

5 

9 

7 

21 

1,200  to  1,799 

7 

3 

9 

19 

1,800  to  2,699 

8 

21 

6 

. ! 

35 

Total 

20 

33 

22 

75 

We  note  that  the  minimum  expected  frequency  is  at  least  5  and 
compute 


52  ,  92  ,  7-  ,  7-  ,  32 

(21)  (20)  +  (2l)~(33)  +  (21)~(22)  +  (19)  (20)  +  (19)  (33) 

,  21  1  82  1  21,  1  6*  1" 

(19) (22)  (35) (20)  (35) (33)  (35) (22) 


=  75(0.13955)  =  10.466 
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When  the  form 


is  used,  we  must  carry  extra  decimal  places  to  obtain  an  accurate  value, 
because  of  the  order  of  operations  (a  small  difference  between  relatively 
large  numbers,  multiplied  by  a  large  number). 

From  Appendix  Table  4,  we  find  that  x2„.  =  9.488.  Since  10.466 

exceeds  this  value,  we  reject  the  hypothesis  that  lateral  deflection  and 
range  are  independent  characteristics. 

The  x2  test  is  an  approximation  which  improves  as  the  cell  frequencies 
increase.  In  the  case  of  2-by-2  tables  (i.e.,  contingency  tables  for  which 
r  =  s  =  2),  the  approximation  can  be  improved  by  applying  Yates's  conti¬ 
nuity  correction.  If  we  have  the  entries  shown  in  Table  4.8,  then  on  the 
null  hypothesis  that  the  two  characteristics  are  independent,  the  expected 
frequency  (with  the  given  marginal  totals)  corresponding  to  the  observed 
frequency  a  is  (a  4-  c)  (a  4-  b)/n.  Applying  Yates’s  correction,  we  replace 
a  by  a  4-  V2  if  (a  +  c)  (a  4-  b)/n  >  a ,  and  by  a  —  Vi  if  the  reverse  inequality 
is  true;  then  the  other  frequencies  b ,  c,  and  d  must  be  changed  by  V2  so  as 
to  preserve  the  same  marginal  totals.  (Actually,  it  makes  no  difference 
whether  we  start  with  a ,  b,  c,  or  d.)  The  correction  thus  decreases  the  differ¬ 
ence  between  each  pair  of  observed  and  expected  frequencies  by  V2.  If  each 
observed  frequency  is  so  close  to  the  expected  frequency  that  the  correction 
reverses  the  algebraic  sign  of  the  difference,  then  the  agreement  is  as 
good  as  possible,  the  null  hypothesis  is  immediately  accepted,  and  no 
detailed  calculation  is  necessary  to  test  it.  Otherwise  Yates’s  correction 


TABLE  4.8.  Conventional  Notation  for  a  2-by-2 
Contingency  Table 


Character- 

Characteristic  I 

■ 

Total 

istic  II 

1 

2 

1 

a 

b 

a  4*  b 

2 

c 

d 

c  +  d 

Total 

a  +  c 

b  +  d 

n 

100 
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should  always  be  made  in  2-by-2  tables;  this  is  especially  important  for  smail  ‘  n 
expected  frequencies.  If  small  expected  frequencies,  say  less  than  5,  occur  in 
a  2-by-2  table,  it  may  be  desirable  to  apply  an  exact  test.  (See  Ref.  4.7,  4.8, 

4.9,  and  4.10.  Table  38  of  Ref.  4.9  gives  the  exact  probability  of  each 
2-by-2  table  with  row  (or  column)  sums  up  to  15;  Ref.  4.8  extends  this 
table  to  row  (or  column)  sums  up  to  20.  See  Sec.  4.7.1  for  an  example  of 
the  use  of  Yates’s  correction.) 

For  a  2-by-2  table,  after  the  application  of  Yates’s  correction,  the  formula 
simplifies  to 

2  = _ n{dd'  ~  b'c'Y _ 

^  (<*  +  b)  (a  +  c )  ( c  +  d)  (b  +  d) 

where  the  primes  indicate  that  Yates’s  correction  has  been  applied. 

4.7.  Consistency  of  Several  Samples 

The  x2  test  of  Sec.  4.7.1  may  be  used  on  data  in  the  form  of  frequencies 
to  determine  whether  several  samples  are  "consistent,”  that  is,  drawn  from 
populations  having  the  same  distribution.  For  data  not  in  the  form  of 
frequencies,  the  run  test  of  Sec.  4.7.2  is  applicable  when  only  two  samples 
are  involved. 

4.7.1.  The  x2  Test 

Let  s  different  samples  be  drawn  and  the  degree  of  a  certain  characteristic 
be  observed  for  each  one,  the  frequency  with  which  the  ;th  degree  of 
the  characteristic  (/'  =  1,  2,  •  •  •  ,  r)  occurs  in  the  ;'th  sample  being  recorded 
in  a  table  of  the  same  form  as  Table  4.6. 

To  test,  at  the  significance  level  a,  the  null  hypothesis  that  the  s  samples 
were  drawn  from  populations  having  the  same  distribution,  proceed  with 
the  x2  test  exactly  as  in  Sec,  4.6,  even  though  the  statistical  model  is  now 
different;  that  is,  the  column  totals  '’re  not  random  but  are  the  specified 
sample  sizes. 

Example.  The  results  of  testing  fuzes  at  -40  and  130°F  are  shown 
in  Table  4.9.  Is  there  a  significant  difference  at  the  5%  level  in  the  pro¬ 
portion  of  failures  at  the  two  temperatures  ? 

We  apply  Yates’s  continuity  correction  described  in  Sec.  4.6.  Here  the 
expected  frequency  of  failures  at  130T  L  (59  X  15)/13<-  =  6.705  >  6. 
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TABLE  4.9.  Results  of  Experiment  With  132  Fuzes 


Effect  of 

Fuzes  tested 

Total 

test 

At  —  40°F 

At  130'F 

Successes 

64 

53 

117 

Failures 

9 

6 

15 

Total 

75 

59 

132 

Therefore,  we  replace  6  by  6.5,  and  adjust  the  other  frequencies  to  pre¬ 
serve  the  marginal  totals.  The  adjusted  data  are  shown  in  Table  4.10. 
If  the  expected  frequency  had  been  6.4,  for  example,  we  would  not  have 
replaced  6  by  6.5;  we  would  have  accepted  immediately  the  hypothesis 
of  consistency,  since  6  is  different  from  6.4  by  less  than  Vi- 


TABLE  4.10.  Table  Obtained  by  Applying  Yates’s 
Correction  to  Table  4.9 


Effect  of 

Fuzes  tested 

Total 

test 

At  -4o’F 

At 130“F 

Successes 

64.5 

52.5 

117 

Failures 

8.5 

6.5 

15 

Total 

73 

59 

132 

Using  the  simplified  formula  for  a  2-by-2  table,  we  have 


X"  = 


n_(a'd'  -  b'e')* 


(a  +  b)  (a  +  c)  ( c  +  d){b  +  d) 

_  omm 

(117) (73) (15) (59) 


Since  the  tabled  value  x205i1  —  3.841  is  greater  than  0.01273,  we  cannot 
reject  the  null  hypothesis  that  the  proportion  defective  is  the  same  at  the 
two  temperatures. 


4.7.2.  Run  Test  for  Comparing  Two  Samples 

The  run  test  described  in  Sec.  4.1  can  be  used  also  to  determine  whether 
two  random  samples,  say  x’s  and  y’s,  were  drawn  from  populations  having 
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the  same  distribution.  The  form  of  the  distribution  need  not  be  specified 
or  even  known. 

Arrange  all  the  readings,  x's  and  j's  together,  in  order  of  size.  Count  v, 
the  number  of  runs  in  x's  and  ys  (instead  of  +  ’s  and  — as  in  Sec.  4.1). 
Reject  the  null  hypothesis  of  consistency  if  v  is  less  than  or  equal  to  the 
value  v,t:)  of  Appendix  Table  10.  (This  gives  a  test  at  the  5%  level  of 
significance.) 

Example.  In  Sec.  2.5.3b,  two  samples  of  ten  were  given  whose 
respective  populations  were  assumed  to  have  normal  distributions  with 
the  same  standard  deviation  and  with  means  3.00  mm  apart.  Test  at  the 
5%  significance  level  whether  the  difference  is  3.00  mm. 

We  subtract  3.00  mm  from  the  readings  of  Sample  1  and  list  all  20 
readings  in  order  of  size,  as  shown  in  Table  4.11.  The  number  v  of  runs 
is  6.  For  «,  -  n.,  =  10,  Appendix  Table  10  gives  v  ,Kl  =  6.  Thus,  we 
reject  the  null  hypothesis.  Note  that  the  test  says  nothing  about  whether 
the  distributions  are  normal,  but  merely  considers  whether  they  are  the 
same. 


TABLH4.ll.  Length  in  mm  of 
Parts  From  Two  Machines 


93.75 

95.07“ 

93.90 

95.21 

94.36 

95.31 

94.50 

95.50“ 

94.73 

95.51“ 

94.82 

95.60“ 

95.02 

95.93“ 

95.03° 

96.01“ 

95.06“ 

96.16 

95.06“ 

96.67“ 

“Readings  from  Sample  1,  with  3.00  mm 
subtracted. 


4.8  Gross  Errors 

Often  an  experimenter  wants  to  discard  from  his  sample  a  reading  that 
appears  too  large  or  too  small,  feeling  that  it  will  ruin  his  results.  When 
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is  he  justified  in  doing  so?  There  is  a  simple  answer  in  physical  terms: 
He  is  justified  whenever  the  reading  in  question  did  not  come  from  the 
population  he  intended  to  sample  but  from  some  other  population.  For 
example,  if  ht  learns  that  the  power  supply  was  erratic  during  the  period 
from  0840  to  0845,  he  is  justified  in  throwing  out  readings  known  to  have 
been  taken  during  that  five  minutes,  under  the  supposition  that  they  did 
not  come  from  the  population  with  the  intended  power  supply. 

Whenever  possible,  such  a  physical  criterion  as  this  should  be  used  when 
data  are  to  be  discarded.  The  experimenter  who  discards  a  reading  just 
because  it  looks  too  large  compared  with  the  others  may  be  in  error,  for 
"outside"  readings  are  often  correctly  drawn  from  the  right  population  and 
should  influence  the  results,  especially  in  small  samples.  For  instance,  it  can 
be  shown  that  in  the  long  run  one  out  of  every  10  samples  of  size  3  from 
the  same  normal  population  will  have  one  of  the  readings  at  least  16  times 
as  far  from  the  middle  one  as  the  other. 

If  the  experimenter  wants  a  statistical  test  to  help  satisfy  himself  that  a 
reading  is  too  extreme  to  have  come  from  the  right  population  and  if  he  is 
dealing  with  a  normal  population,  Appendix  Table  \6  may  be  used  as  fol-. 
lows:  Arrange  the  sample  in  order  of  size,  x,,  x.,  •  •  •  ,  x„,  starting  with 
the  smallest  or  the  largest,  depending  on  which  value  is  to  be  considered  for 
rejection.  According  to  the  sample  size  n,  compute  the  ratio  shown  in 
Appendix  Table  16.  If  it  exceeds  the  critical  value  given  under  a  =  .05, 
reject  at  the  5%  level  of  significance  the  null  hypothesis  that  x,  is  from  the 
same  normal  population  as  the  other  sample  members.  In  this  case,  omit  xt 
from  further  calculations  and  us-' the  values  x2,  x3,  •  •  •  ,  xw.,,x„. 

Example.  Burning  times  for  •  :x  firings,  from  a  normal  distribution, 
were  recorded  as  follows: 

0.505  0.511  0.519 

0.478  0.357  0.506 

Since  the  reading  0.357  is  much  smaller  than  the  other  readings,  should 
it  be  discarded  ?  Suppose  the  experimenter  suspects  that  the  reading  was 
subject  to  an  observer’s  error,  an  unusual  condition  in  firing,  or  a  faulty 
gage,  but  wishes  to  apply  a  statistical  test  to  help  satisfy  himself  that  his 
explanation  is  correct. 
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For  the  six  readings  given  .bove 


r 


10 


=  0-478  -  0.337  = 
x0-x,  0.519-0.357 


Since  0.747  exceeds  the  tabled  value  0.628,  listed  under  a  =  .05,  the 
probability  is  less  than  5%  that  six  readings  from  the  same  normal  popu¬ 
lation  would  include  one  reading  so  remote  from  the  rest. 


n 


4.9.  Tolerance  Intervals 

To  construct,  on  the  basis  of  a  sample  ;/  from  a  normal  population,  a 
pair  of  symmetric  tolerance  limits  (Sec.  1.1.11)  for  100P%  of  the  population 
with  confidence  coefficient  1  —  a,  obtain  a  coefficient  K  from  Appendix 
Table  17.  The  desired  limits  are  then  x  ±  Ks,  where  x  and  s  are  computed 
from  the  sample.  If  many  samples  n  are  drawn  and  the  limits  x  ±  Ks 
computed  in  each  case,  then  100(1  -  «)%  of  the  time  the  tolerance  limits 
so  computed  will  enclose  at  least  100 P%  of  the  population  from  which  the 
sample  was  drawn. 

For  more  material  on  tolerance  intervals,  see  Sec.  1.1.11 ;  for  more  exten¬ 
sive  tables  of  K,  see  Ref.  4.10. 

Example.  A  sample  of  28  has  mean  10.02  and  standard  deviation 
s  =  0.13.  Find  tolerance  limits,  having  confidence  coefficient  0.95,  for 
90%  of  the  population. 

For  n  —  28,  a  =  0.05,  and  P  =  0.90,  Appendix  Table  17  gives  K  = 
2.164.  We  compute 

x  ±  Ks  —  10.02  ±  (2.164)  (0.13) 

=  10.02  ±  0.28 

The  tolerance  limits  are  9.74  and  10.30.  If  tolerance  limits  x±  2.164j 
are  constructed  for  many  samples  of  size  28,  in  the  long  run  95%  of  such 
pairs  will  enclose  at  least  90%  of  the  normal  population  sampled. 

A  one-sided  upper  tolerance  limit  U  can  be  constructed  so  that  with 
confidence  coefficient  1  —  a  at  least  100P%  of  the  normal  population  will  be 
less  than  U  (Ref.  4.10).  We  have 


U  =  x  ks 
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where 

k  =  K,_P  +  V  Wp  ~  g 


d  =  1 


2(«-l) 


£  =  KIp 


with  Ke  defined  so  that  the  area  under  the  standardized  normal  curve  to 
the  right  of  Ke  is  equal  to  e  as  in  Fig.  4.4.  Similarly,  a  one-sided  lower 
tolerance  limit  is  given  by  L  =  x  -  Ks. 


Exr>  mple.  Find  a  one-sided  tolerance  limit  for  the  preceding  example. 

We  read  opposite  F(z)  =  1  —  e  in  Appendix  Table  2  the  desired 
Ke,  and  compute 
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Ki-i»  =  K.io  =  1-282 
Ka  =  Ku3  =  1.645 
;/  =  28 


rf  =  1  ~  =  0-94989 

b  =  (1.282)2  -  ^  =  1.5469 


,  _  1.282  +  V  ( 1 .282 )2  -  (0.94989)  (1.5469)  _  ,  ^ 

k,  — - - * — '  —  1 .  foy 

0.94989 

U  =  x  +  ks  =  10.02  +  (1.789)  (0.13)  =  10.25 

Thus,  we  may  be  95%  confident  that  at  least  90%  of  the  population 
lies  below  U  =  10.25. 
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Chapter  5 


PLANNING  Of  EXPERIMENTS  AND 
ANALYSIS  OF  VARIANCE 


5.0.  Introduction 

An  experiment  is  generally  conducted  (1)  to  determine  whether  some 
conjectured  effect  exists,  and  (2)  if  the  effect  does  exist,  to  determine  its 
size.  If  the  experimenter  is  interested  only  in  the  effect  of  one  factor  (or 
variable)  on  one  other  variable,  the  design  of  his  experiment  may* be 
simple.  But  if  he  is  concerned  with  more  than  one  factor,  as  is  frequently 
the  case,  his  experiment  or  experiments  will  be  more  complex  and  may  be 
conducted  in  various  ways.  The  classical  method  is  to  restrict  attention  to 
one  variable  at  a  time,  the  other  factors  being  held  fixed;  however,  this 
method  also  restricts  conclusions.  When  several  factors  are  of  interest, 
more  reliable  and  more  general  conclusions  can  be  reached  by  introducing 
further  levels  of  these  factors  in  the  initial  design. 

Allowing  more  than  one  factor  to  vary  in  a  single  integrated  design 
also  permits  determination  of  the  interaction  of  factors;  for  instance,  we 
may  learn  that  increasing  a  constituent  of  propellant  powder  increases  the 
burning  rate  more  at  one  powder  temperature  than  at  another. 

Aside  from  the  factors  that  we  want  to  investigate,  there  are  background 
conditions  which  may  affect  the  results  of  the  experiment.  Some  of  them 
may  be  taken  into  account  explicitly  in  the  design.  The  influence  of  the 
others  should  be  minimized  by  scheduling  the  experiments  with  all  the 
desired  combinations  of  levels  and  factors  in  a  random  order  that  has 
been  determined,  for  instance,  by  consulting  a  tab'e  of  random  numbers. 

There  may  be  variations  in  background  conditions  that  are  unknown 
to  the  experimenter.  To  be  able  to  conclude  validly  that  a  postulated  effect 
exists,  he  should  plan  the  experiment  so  that  a  control  item  will  be  sub¬ 
jected  to  the  same  background  conditions  (except  for  random  fluctuations) 
as  each  experimental  item.  For  example,  it  cannot  be  concluded  that  a 
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new  type  of  rocket  has  a  smaller  dispersion  than  a  previous  type  unless  the 
two  are  fired  under  the  same  conditions;  for  instance,  one  rocket  of  the 
previous  type  might  be  fired  as  a  control  with  each  new  rocket. 

Measurements  are  not  precisely  reproducible;  therefore,  to  an  extent 
depending  on  t.*ie  size  of  the  experimental  error,  the  experiment  must  be 
repeated  in  order  to  draw  valid  conclusions.  Each  complete  repetition  of 
an  experiment  is  called  a  replication.  The  most  efficient  procedure  would 
be  to  (1)  estimate  the  experimental  error  before  the  experiment  on  the 
basis  of  previous  experience,  (2)  determine  the  number  of  replications 
accordingly,  and  (3)  perform  all  the  various  parts  of  the  total  experiment 
in  a  random  order. 

The  reliability  of  experimental  conclusions  can  also  be  increased  by 
refining  the  experimental  technique  (reducing  the  standard  deviation  of 
the  experimental  error).  Since  a  few  replications  of  a  refined  technique 
can  achieve  the  same  reliability  as  many  replications  of  a  coarse  technique, 
the  choice  of  method  in  a  particular  investigation  may  be  made  on  the  basis 
of  cost. 

These  general  principles  are  exemplified  in  later  sections.  However,  there 
are  many  designs  more  complex  than  those  given  in  this  Manual;  for  these, 
the  experimenter  should  consult  a  statistician  or  study  the  references  at  the 
end  of  this  chapter. 

The  data  obtained  from  an  experiment  involving  several  levels  of  one 
or  more  factors  are  analyzed  by  (he  technique  of  analysis  of  variance. 
This  technique  enables  us  to  break  down  the  variance  oi  the  measured  vari¬ 
able  into  the  portions  caused  by  the  several  factors,  varied  singly  or  in 
combination,  and  a  portion  caused  by  experimental  error.  More  precisely, 
analysis  of  variance  consists  of  (1)  a  partitioning  of  the  total  sum  of 
squares  of  deviations  from  the  mean  into  two  or  more  component  sums  of 
squares,  each  of  which  is  associated  with  a  particular  factor  or  with  experi¬ 
mental  error,  and  (2)  a  parallel  partitioning  of  the  total  number  of  degrees 
of  freedom. 

Let  us  '.'^..sider  an  experiment  on  the  extrusion  of  propellant  grains  in 
which  we  desire  tc  determine  the  effects  of  extrusion  rate  and  die  tempera¬ 
ture  on  grain  port  area.  Ambient  temperature  and  humidity  during  extru¬ 
sion  are  background  conditions  that  may  affect  the  results.  Values  of  port 
area  might  be  observed  at  several  levels  of  each  factor,  at  extrusion  rates 


§5.1. 


COMMON  DESIGNS  FOR  EXPERIMENTS 


111 


of  15,  20,  and  25  in/min  and  at  die  temperatures  of  120  and  140°F.  Each 
combination  of  one  level  of  extrusion  rate  with  one  level  of  die  tempera¬ 
ture  is  called  a  treatment;  for  instance,  an  extrusion  rate  of  20  in/min 
with  a  die  temperature  of  140°F.  (Some  authors  call  this  a  treatment  com¬ 
bination  rather  than  a  treatment.  The  levels  of  each  factor  then  become 
the  treatments.)  If  the  background  conditions  were  controlled  at  each  of 
several  levels,  each  treatment  would  also  include  one  level  of  ambient 
temperature  and  one  level  of  humidity.  Alternatively,  ambient  temperature 
and  humidity  might  be  (1)  held  constant  (thus  restricting  the  range  of  the 
conclusions);  (2)  allowed  to  vary  as  usual,  but  with  their  effects  randomized 
by  randomizing  the  order  of  treatments;  or  (3)  recorded  during  the  experi¬ 
ment  and  later  treated  by  analysis  of  covariance.  (For  information  on  the 
analysis  of  covariance,  see  Ref.  5.5,  5.14,  and  5.15.) 

Analysis  of  variance,  analysis  of  covariance,  and  regression  analysis  are 
specializations  of  the  more  general  theory  of  testing  linear  hypotheses. 
(For  an  explanation  of  regression  analysis,  see  Chapter  6  of  this  Manual; 
for  the  theory  of  testing  linear  hypotheses,  see  Ref.  5.10,  5.11,  and  5.12.) 

5.1 .  Common  Designs  for  Experiments 

The  following  considerations  apply  generally  in  designing  experiments  or 
comparing  designs.  Some  specific  designs  are  treated  in  Sec.  5. 1.1-5. 1.4. 

The  precision  of  an  experiment  is  measured  in  terms  of  a0,  the  standard 
deviation  (assumed  common)  of  the  populations  of  observations  for  each 
treatment — in  other  words,  the  experimental  error.  (See  Sec.  5.3.)  If  the 
design  is  given  m  replications,  so  that  each  treatment  is  applied  m  times, 
the  standard  deviation  of  the  estimate  of  the  difference  between  two  treat¬ 
ment  effects  is  proportional  to  a0/\~ m.  Thus,  as  the  number  of  replications 
increases,  the  error  decreases.  This  formula  applies  only  if  the  replications 
are  independent  and  the  experimental  material  remains  homogeneous  as 
the  experiment  increases  in  size.  Elaborations  of  such  error  considerations 
make  it  possible  to  determine  how  large  an  experiment  should  be  for 
detection  of  effects  of  a  prescribed  size,  somewhat  in  the  manner  that 
Appendix  Charts  V-X  can  be  used  in  simpler  experiments.  (See  Ref.  5.4 
and  5.10.) 

Designs  may  be  compared  on  the  basis  of  the  number  of  degrees  of 
freedom  for  estimating  the  standard  error  <r0.  A  decrease  in  the  number 
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of  degrees  of  freedom  will  decrease  the  sensitivity  of  the  experiment. 
Suppose  that  an  observed  variable,  compressive  strength  of  grain,  is  to  be 
tested  with  four  levels  each  of  three  factors:  temperature,  extrusion  rate, 
and  humidity.  Let  us  consider  the  following  methods: 

1.  We  may  use  a  full-scale  4X4X4  factorial  design  (Sec.  5.1.1)  with 
64  treatments.  This  will  yield  27  degrees  of  freedom  for  estimating  the 
error  variance. 

2.  We  may  decide  to  include  only  two  of  the  three  factors  if  the  results 
obtained  from  these  are  to  be  applied  to  only  one  level  of  the  third  factor, 
or  if  the  four  levels  of  the  third  factor  are  expected  to  have  about  the 
same  effect  on  the  compressive  strength  of  the  grain.  (If  this  assumption 
is  not  justified,  the  standard  error  <x0  will  be  increased  by  the  variation 
among  the  levels  in  case  the  four  levels  of  the  third  factor  are  included 
by  randomization.  If  only  one  level  of  the  third  factor  is  included,  the 
results  will  apply  only  to  that  level.)  Using  a  4  X  4  factorial  design  with 
16  treatments  will  yield  nine  degrees  of  freedom  for  estimating  the  error 
variance  if  the  two  factors  are  independent;  i.e.,  if  their  interaction  can  be 
neglected. 

3.  We  may  replicate  the  4X4  design  of  paragraph  2  four  times 
to  give  64  applications  of  the  16  treatments  (four  each).  This  procedure 
will  yield  48  degrees  of  freedom  for  estimating  the  error  variance.  Then 
the  standard  error  for  the  difference  between  two  treatment  effects  will  be 
the  same  as  in  paragraph  1  and  one-half  that  in  paragraph  2. 

4.  We  may  use  a  4  X  4  Latin-square  arrangement  (Sec.  5.1.4)  of  two 
factors,  with  the  four  levels  of  the  third  factor  superimposed  according 
to  a  randomly  selected  4X4  Latin  square.  This  design  assumes  that  all 
three  factors  are  independent  and  gives  information  on  all  three  with  a 
minimum  number  of  treatments — 16  in  the  example.  However,  there  are 
only  six  degrees  of  freedom  for  estimating  the  error  variance;  as  a  result, 
substantial  real  effects  of  the  factors  may  go  undetected  because  of  the 
large  random  fluctuation  which  must  be  allowed  for  error. 

Which  of  the  above  designs  is  best  cannot  be  decided  categorically 
without  further  knowledge  of  the  conditions  to  be  met.  Under  various 
conceivable  circumstances  any  one  of  these  designs,  or  some  other  design, 
might  be  chosen.  Whatever  the  bases  for  decision,  it  is  essential  that  a 
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systematic  method  be  used  for  selecting  and  carrying  out  the  design.  The 
following  list  provides  a  useful  general  routine  for  designing  and  running 
an  experiment. 

1 .  State  the  objectives  of  the  experiment.. 

a.  Use  information  from  previous  experiments. 

b.  Choose  experi.  lental  conditions  to  represent  the  conditions  under 
which  the  results  will  be  applied.  This  may  be  done  by  either 
systematic  choice  or  randomization. 

c.  State  the  precise  hypotheses  that  are  to  be  tested. 

2.  Draw  up  a  preliminary  design. 

a.  Take  into  account  the  experimental  error,  and  the  number  of 
degrees  of  freedom  for  estimating  it,  provided  by  each  proposed 
design. 

b.  Consider  the  cost  of  experimentation  versus  the  cost  of  wrong 
decisions. 

3.  Review  the  design  ivith  all  collaborators. 

a.  Reach  an  understanding  as  to  what  decisions  hinge  on  each  out¬ 
come.  Keep  notes. 

b.  Encourage  collaborators  to  anticipate  all  factors  that  might  affect 
the  results. 

c.  Discuss  the  experimental  techniques  in  sufficient  detail  to  discover 
any  procedures  that  might  lead  to  bias. 

4.  Draw  up  the  final  design. 

a.  Present  the  design  in  clear  terms  to  assure  that  its  provisions  can  be 
followed  without  confusion. 

b.  Include  the  methods  of  analysis  as  part  of  the  design,  ascertaining 
that  conditions  necessary  for  the  validity  of  these  methods  will  be 
met. 

5.  Carry  out  the  experiment. 

a.  During  the  course  of  the  experiment,  maintain  communication 
among  all  collaborators,  so  that  questions  arising  from  unforeseen 
experimental  conditions  or  results  may  be  answered  in  keeping 
with  the  design  agreed  upon. 

6.  Analyze  the  data. 

a.  Fellow  the  methods  outlined  in  the  final  design  (Step  4). 

7.  Write  a  report. 

a.  Present  the  data  and  results  in  clear  tables  and  graphs. 
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b.  Compare  the  results  with  the  stated  objectives  of  the  experiment 
(Step  1). 

c.  If  the  results  suggest  further  experimentation,  outline  the  course 
that  such  experimentation  should  take. 


5.1.1.  Complete  Factorial  Design  for  Treatments 

The  complete  factorial  design  for  arranging  the  treatments  to  be  tested 
calls  for  the  use  of  every  combination  of  the  different  levels  of  the  factors. 
Applying  the  complete  factorial  design  to  the  example  in  Sec.  5.0,  the 
number  of  treatments  would  be  3  X  2  =  6,  as  shown  in  Table  5.1. 


TABLE  5.1.  Treatments  for  Complete  Factorial 
Design  of  Propellant-Grain  Experiment 


Treatment 

Extrusion  rate, 
in/min 

Die  temperature, 

°F 

1 

15 

120 

2 

15 

140 

3 

20 

120 

4 

20 

140 

5 

25 

120 

6 

25 

140 

Note  that  in  this  design  both  factors,  extrusion  rate  and  die  temperature, 
are  investigated  simultaneously,  whereas  in  the  classical  method  all  varia¬ 
bles  but  one  remain  fixed.  The  advantages  of  complete  multiple  factorial 
design  are  (1)  the  increased  generality  of  the  conclusions  for  each  factor, 
which  may  apply  to  several  levels  of  the  other  factors  rather  than  to  one 
fixed  level;  and  (2)  the  possibility  of  checking  for  interactions  (i.e.,  effects 
produced  by  two  or  more  factors  jointly  that  go  beyond  the  total  of  their 
individual  effects).  Examples  of  interaction  are  common  in  chemistry:  the 
properties  of  a  mixture  may  differ  widely  from  the  properties  of  the 
components. 

To  compare  only  two  treatments — an  extrusion  rate  of  15  if  min  and 
a  die  temperature  of  120°F  with  an  extrusion  rate  of  20  in/mii.  and  a  die 
temperature  of  ’20°F — and  to  estimate  the  experimental  error  by  the 
variability  within  each  treatment,  we  could  use  the  t  test  (Sec.  2.5.3b). 
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Also,  all  six  treatments  mentioned  abo\e  could  be  compared  pairwise  in 
(6  X  c)/2  =  15  t  tests,  but  these  tests  would  not  be  independent.  More 
information  can  be  extracted  from  the  data  if  we  study  all  the  treatments 
together. 

5.1.2.  Complete  Randomization  for  Background  Conditions 

Suppose  we  wish  to  test  the  effects  of  the  use  of  Friden,  Marchant,  and 
Monroe  desk  computers  (three  treatments)  on  the  time  it  takes  to  compute 
sums  of  squares  of  12  four-digit  numbers,  and  to  obtain  conclusions 
unaffected  by  the  variability  among  operators. 

Several  background  conditions,  though  not  of  prime  importance  in  the 
experiment  at  hand,  are  likely  to  influence  the  results.  If  only  one  oper¬ 
ator  is  used,  he  may  have  a  natural  preference  for  one  of  the  machines, 
so  that  the  experimental  results  will  apply  only  to  him.  If  several  operators 
are  used,  one  may  be  so  much  more  skilled  than  the  others  that  his  com¬ 
puter  will  appear  best  even  if  it  is  not.  The  order  of  the  trials  may  make  a 
difference.  Each  operator  may  become  more  relaxed  during  the  test  and 
his  speed  may  consistently  increase  as  the  test  progresses,  or  he  may  tire 
and  his  speed  may  consistently  decrease.  If  he  uses  the  same  twelve  num¬ 
bers  in  each  trial,  his  proficiency  will  almost  certainly  increase. 

Some  of  these  conditions  may  be  varied  systematically.  For  instance,  we 
can  arrange  the  experiment  so  that  each  operator  will  use  each  machine  for 
the  same  number  of  trials.  Ocher  conditions,  such  as  the  order  of  trials, 
can  be  randomized  by  flipping  coins  or  by  drawing  numbered  chips  from 
a  bowl.  This  procedure  may  still  favor  some  particular  machine  (by  placing 
it  last,  say)  unless  the  entire  experimenc  is  repeated  several  times  to  yield 
several  replications.  Familiarity  with  the  same  numbers  can  be  avoided  by 
drawing  anew  12  four-digit  numbers  from  a  table  of  random  numbers  for 
each  operator-machine  combination. 

Successful  experiments  depend  heavily  on  ingenuity  in  spotting  possible 
sources  of  bias  and  eliminating  them  by  a  well-organized  plan.  Before  the 
experiment,  we  should  decide  upon  and  record  the  procedure  that  will  be 
followed  for  every  contingency  that  may  arise  during  the  experiment,  and 
the  conclusion  that  will  be  drawn  from  every  possible  outcome.  In  the 
computing  experiment,  for  instance,  we  should  decide  on  the  procedure 


116 


ANALYSIS  OF  VARIANCE 


§5.1. 


in  case  one  of  the  sums  is  computed  incorrectly,  or  if  a  claim  is  presented 
later  that  one  of  the  machines  was  out  of  adjustment. 

Complete  randomization  is  recommended  for  small  experiments  on 
material  known  to  be  homogeneous  and,  because  of  ease  in  analysis  com¬ 
pared  with  other  designs,  for  any  experiment  likely  to  contain  failures  that 
must  be  omitted  from  the  analysis. 

5.1.3.  Randomized-Block  Design  for  Background  Conditions 

In  larger  experiments  the  background  conditions  may  lack  homogeneity, 
so  that  complete  randomization  of  the  whole  background  at  once  would 
introduce  an  unnecessary  amount  of  error.  In  this  case,  we  may  decide  to 
randomize  one  block  of  the  background  at  a  time,  keeping  each  block 
intact  so  that  its  effect  will  appear  explicitly  in  the  analysis. 

In  the  test  of  desk  computers  described  above,  considerable  variation 
might  occur  if  an  operator  were  trained  on  a  particular  machine,  or  had 
used  it  the  most,  or  preferred  it.  Restricting  the  test  to  operators  trained 
on  Fridens  would  limit  our  conclusions  to  such  operators.  Randomizing 
the  selection  of  operators  according  to  their  training  and  analyzing  the 
total  data  would  not  ascertain  whether  the  training  of  the  operator  had 
affected  the  results;  if  there  was  such  an  effect  it  would  appear  as  experi¬ 
mental  error.  We  might  prefer  to  consider  the  operators  trained  on  the 
three  different  machines  in  three  separate  blocks,  preferably  of  equal  size. 
Within  each  block  we  would  still  randomize  the  order  of  use  of  the  com¬ 
puters.  At  the  end  of  the  experiment  we  could  assess  the  effect  of  the  type 
of  training  on  computing  speed. 

5.1.4.  Latin-Square  Design  for  Background  Conditions 

To  test  k  treatments  under  k  choices  for  one  background  condition  and 
k  choices  for  a  second,  independent  background  condition,  we  can  use  a 
Latin-square  design;  i.e.,  a  square  array  of  letters  such  that  each  letter 
ippears  exactly  once  in  each  row  and  exactly  once  in  each  column.  (For 
further  material  on  Latin  squares,  see  Ref.  5.9;  for  examples  of  Latin 
squares  of  various  sizes,  see  Ref.  5.7.) 

Suppose  we  wish  to  test  the  effect  of  office  noise  level  on  the  speed  of 
computing,  taking  into  account  as  background  conditions  the  desk  computer 
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tion,  are  available  to  meet  specific  needs  (Ref.  5.4,  5.10,  and  5.15).  For  the 
analysis  of  these  and  other  basic  designs,  see  Ref.  5.10. 

Most  designs  have  symmetries  that  make  their  analysis  simple.  However, 
designs  that  are  unsymmetrical  (because  data  are  missing,  for  example)  may 
be  analyzed  according  to  the  general  theory  of  testing  linear  hypotheses 
(Ref.  5.11).  This  theory  has  become  somewhat  easier  to  apply  since  the 
advent  of  high-speed  electronic  computing  machines. 

5.2.  Types  of  Analysis  of  Variance 

There  are  two  mathematical  models  to  which  the  analysis  of  variance 
applies — a  model  for  Type  I  problems  (Sec.  5.2.1)  and  a  model  for  Type 
II  problems  (Sec.  5.2.2).  This  Manual  does  not  treat  the  more  complicated 
case  of  "mixed’’  models;  material  on  this  subject  may  be  found  in  Ref. 
5.1  and  5.15. 

5.2.1.  Type  I  Problems 

When  particular  levels  of  a  factor  are  selected  purposely  for  study 
because  they  are  the  only  levels  of  interest  or  are  considered  the  most 
promising,  the  analysis  of  variance  becomes  a  comparison  of  the  mean 
effects  of  those  particular  levels.  Statistical  t°sts  (F  tests)  are  made  to  see 
whether  the  observed  differences  in  mean  effects  are  real  or  random.  If 
the  differences  are  real,  the  population  constants  or  parameters  (main  effects 
and  interactions)  may  be  estimated  easily  as  averages.  In  these  Type  I 
problems,  the  analysis  of  variance  refers  to  the  finite  number  of  treatments 
actually  chosen,  and  the  variation  is  called  systematic.  The  problems 
treated  in  most  elementary  texts  are  of  this  type  (Ref.  5.5).  The  experi¬ 
ments  on  desk  computers  described  in  Sec.  5.1.2  and  5.1.3  involve  the 
systematic  variation  of  machines  from  Friden  to  Marchant  to  Monroe. 

5.2.2.  Type  II  Problems 

When  levels  of  each  factor  are  drawn  at  random,  the  analysis  of  variance 
is  concerned  not  so  much  with  the  particular  levels  appearing  in  the  experi¬ 
ment  as  with  the  larger  finite  or  infinite  population  of  levels  that  had  an 
equal  chance  of  being  drawn.  The  analysis  of  variance  provides  statistical 
tests  (F  tests)  to  see  whether  the  random  variation  of  levels  of  each  factor 


$5.1. 


COMMON  DESIGNS  FOR  EXPERIMENTS 


117 


used  and  the  operator  making  the  trial.  We  might  choose  fou>.  supposedly 
like  computers  at  random  from  the  entire  production  of  a  given  model  and 
four  operators  at  random  from  a  large  office  staff.  The  trials  might  then  be 
arranged  in  a  Latin  square  as  in  Table  5.2,  operator  1  computing  on 
machine  3  at  noise  level  D,  for  instance.  The  order  of  the  trials  could  be 
randomized  to  take  care  of  the  learning  factor.  Only  4  X  4  =  16  tests  are 
required.  If  more  than  16  tests  are  desired,  further  randomly  selected  Latin 
squares  of  the  same  size  may  be  used. 


TABLE  5.2.  A  4  X  4  Latin  Square  for  Noise-Level  Experiment 

The  letters  A,  B,  C,  and  D  represent  levels  of  noise,  the  volume  increasing  from 
A  to  D. 


Background 
condition  II 
(operator) 

Background  condition  I  (computer) 

1 

2 

3 

4 

1 

B 

C 

D 

A 

2 

D 

B 

A 

C 

3 

C 

A 

B 

D 

4 

A 

D 

C 

B 

Advantages  of  the  Latin-square  design  are  economy  of  samples  and  ready 
analysis.  The  main  disadvantage  is  inflexibility,  since  the  same  number  of 
choices  is  required  for  each  variable  and  the  number  of  degrees  of  freedom 
is  thereby  determined  also. 

The  restriction  that  the  two  background  conditions  and  the  factor  to  be 
tested  be  independent  (no  interactions)  is  sometimes  a  hard  one  to  meet. 
For  instance,  in  the  noise-level  experiment  it  must  be  assumed  that  the 
level  of  noise  has  the  same  effect  on  the  speed  of  computing,  regardless  of 
the  operator.  If  one  operator  is  particularly  bothered  by  noise  or  another 
operator  is  unusually  oblivious  to  noise  while  he  computes,  the  conclusions 
of  the  experiment  will  be  made  unreliable. 

5.1.5.  Other  Designs 

Generalizations  of  these  designs,  such  as  the  Greco-Latin  square  (k 
levels  of  each  of  three  background  conditions),  incomplete  block  designs, 
and  designs  using  "confounding”  including  those  using  fractional  replica- 
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actually  contributes  variation  to  the  variable  under  study.  These  tests  for 
Type  II  problems  are  of  the  same  form  as  those  used  for  Type  I  problems; 
however,  the  objective  in  Type  II  problems  is  the  estimation  of  the  com¬ 
ponent  of  variance  contributed  by  each  factor  to  the  total  variance  of  the 
variable  under  study,  rather  than  the  estimation  of  the  main  effects  and 
interactions  at  particular  levels  of  each  factor,  as  in  Type  I  problems  (Ref. 
5.6). 

In  the  noise-level  experiment  of  Sec.  5.1.4  we  drew  four  computers  at 
random  from  the  total  production  of  one  model  and  four  operators  at 
random  from  a  large  office  staff.  Here,  then,  conclusions  as  to  the  effect  of 
noise  level  can  be  drawn  for  all  computers  of  that  model  and  for  all 
operators  on  the  staff.  The  experiment  will  also  show  the  variability  of 
computers  and  the  variability  of  operators  in  their  effects  on  speed  of 
computing. 

5.3.  Assumptions  of  Analysis  of  Variance 

(1)  Observations  are  random  with  respect  to  any  conditions  not  sys¬ 
tematically  varied  or  removed  from  the  analysis  by  covariance  methods. 

(2)  Means  and  variances  are  additive.  In  the  Type  I  model  we  look  upon 
each  observation  ( xijt ,  for  example,  which  receives  factor  1  at  level  ;  and 
factor  2  at  level  j)  as  composed  of  the  separate  parts 

*\jt  =  /*  +  «i  +  Pj  +  eat 

where 

ix  “  over-all  mean 

a4  =  mean  effect  c  f  factor  1  at  level  i 

flj  =  mean  effect  of  factor  2  at  level  j 

eijt  =  random  deviation  from  the  mean  position  of  the  /th  item  receiving 
the  treatment  //.  The  eijt  are  assumed  to  have  population  means 
of  zero. 

In  the  Type  I  model  2  —  2  Pi  ~  0- 

i  J 

In  the  Type  II  model  we  again  take  the  observation  xijt  as  a  sum  of  the 
terms  given  above,  but  now  every  term  except  /x  is  random,  so  that  the 
variance  of  a  random  observation  xljt  is  the  sum  of  a  variance  component 
caused  by  factor  1,  a  variance  component  caused  by  factor  2,  and  an  error 
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variance  not  caused  by  either  factor.  In  many  problems  the  variation  caused 
by  several  factors  is  greater  than  the  sum  of  their  independent  effects,  so  that 
other  joint  effects  (interactions)  must  be  added,  as  in  Sec.  5.5.2. 

(3)  Experimental  errors  eijt  are  independent. 

(4)  Variances  of  the  experimental  errors  eijf  for  all  pairs  i,  j  are  equal, 
with  the  common  value  a*. 

(5)  Distribution  of  experimental  errors  is  normal. 

To  estimate  mean  effects  or  components  of  variance,  we  need  only  the 
first  four  assumptions.  To  make  significance  tests  of  the  means  or  com¬ 
ponents,  or  to  determine  how  precise  (or  good)  our  estimates  are,  we  must 
satisfy  all  five  assumptions.  If  certain  of  the  assumptions  are  not  met,  the 
analysis  of  variance  provides  only  an  approximate  procedure;  however,  this 
approximation  is  usually  the  best  available.  (See  Ref.  5.3  for  a  discussion  of 
the  effects  of  deviations  from  the  assumptions,  and  Ref.  5.2  for  methods 
of  inducing  normality.) 

5.4.  One-Factor  Analysis 

Table  5.3  shows  in  general  form  the  tabulation  that  would  result  from 
an  experiment  to  compare  r  different  treatments,  levels  of  a  single  factor. 
Each  treatment  has  been  used  (replicated)  m  times.  There  is  some  random 
variation  among  the  readings  for  any  one  treatment,  but  it  is  the  treatment 
sample  means  that  are  compared.  The  difference  among  the  means  is 
not  considered  significant  unless  it  is  large  compared  with  the  random 
within-treatments  variation. 

Throughout  Sec.  5.4.1  and  5.4.2  the  reader  may  find  it  helpful  to  follow 
the  examples  given  in  Sec.  5.4.2  and  to  compare  Table  5.5  with  Table  5.3 
and  Tables  5.6  and  5.7  with  Table  5.4. 

5.4.1.  Assumptions 

For  general  assumptions  of  the  analysis  of  variance,  see  Sec.  5.3.  The 
assumptions  for  one-factor  analysis  follow. 

(1)  The  m  replications  x^,  xi2,  .  .  .  ,  xim  of  the  experiment  for  any 
one  treatment  /  represent  a  sample  drawn  at  random  from  a  normal  popula¬ 
tion  (see  Table  5.3).  Each  observation  xit  can  be  written  as 
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%it  P"  ^ it  1>  2,  *  *  *  ,  f,  t  1,  2,  )  fft') 

where 

p.  =  over-all  mean 

«i  =  mean  effect  of  the  /th  treatment  (or  row) 

eit  =  random  deviation  from  the  mean  position  of  the  /th  item  receiving 
treatment  /.  The  en  have  independent  normal  distributions  with 
mean  zero  and  common  variance  ag. 

In  the  Type  I  model,  2  a*  =  0. 

I 

In  the  Type  II  model  we  assume  that  the  variance  of  the  population  of 
row  effects  is  a\. 

(2)  For  a  Type  I  problem,  the  r  treatments  are  chosen  systematically  by 
the  experimenter  as  important  levels  of  the  factor.  For  a  Type  II  problem, 
on  the  contrary,  we  assume  that  the  r  levels  of  the  factor  are  chosen  at 
random  from  a  normal  population  of  levels  of  the  factor. 


TABLE  5.3.  One-Way  Classification 


Treatment 

Replications 

1 

2 

t 

.  .  . 

m 

1 

Xu 

Xn 

.  .  . 

Xu 

*  *  • 

Xim 

2 

Xn 

X22 

Xu 

7 

%2n 

; 

xu 

xi2 

Xu 

Xim 

, 

. 

• 

• 

• 

r 

Xn 

Xn 

•  •  • 

Xrt 

• » « 

Xrtn 

r  ==  number  of  tested  levels  of  the  factor  (number  of  rows) 

m  =  number  of  replications  of  each  treatment  (number  of  measurements  per 
treatment) 

n  =  rrn  =  total  number  of  observations 
x>,  =  replication  /  of  treatment  i  (trial  /  in  row  /) 

.v,  =  —  y  x,,  —  /th  treatment  mean  (mean  of  the  /th  row) 
m  n 

1  *  1 

x  —  —  2  —  —  2  x"  ~  Srand  mean 

f  (=1  n  1,1 
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5.4.2.  Analysis 

Tne  sum  of  the  squared  deviations  of  all  n  —  rm  observations  from  the 
grand  mean  x  can  he  written  as  m  times  the  sum  of  squared  deviations  of 
the  r  treatment  means  from  the  grand  mean  plus  the  sum  of  squared 
deviations  of  individual  observations  from  their  respective  treatment  means 

r  m  r  r  m 

2  2  (*«  ~  *)■  =  m  2  (xi.  -  xf  +  22  (xu  -  xi.y 

i=i  <=i  i=i  i=i  <=i 

In  the  case  of  a  Type  I  problem  we  make  the  null  hypothesis  that  the 
means  of  the  normal  populations  of  observations  xit  for  the  r  treatments 
are  all  equal;  that  is,  that  the  atj  are  all  zero.  If  the  null  hypothesis  is  true, 
we  have  three  unbiased  estimates  of  the  common  variance  Oq  of  the  r  popu¬ 
lations,  obtained  by  dividing  the  sums  of  squares  in  the  equation  above 
by  their  respective  degrees  of  freedom.  The  estimates  are 

s'i  2  2  (*it  -*)* 

—  r  _  j  2  (xi-  ~  x )2 

■f.s  =  7~-22(^it  ~xi-Y 

(but  the  algebraically  equivalent  computing  forms  in  Table  5.4  are  prefer¬ 
able  for  numerical  work). 


TABLE  5.4.  Analysis  of  Variance  for  One-Way  Classification 


Degrees  Mean  Expected 
freedom  SqUarC  mean  square 


Source  of 
variation 

Sum  of  squares 

Among  treat- 

'2  (2*0’ -(2*")’ 

ments .... 

(2)  =  - : - - - 

n 

Within  treat- 

ments .... 

— 

(3)  =  (1)  -  (2) 

*2**«  ~  (2Xi,): 


Total 


« 


«  -  1 
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If  the  null  hypothesis  of  equal  means  is  true,  the  last  two  quantities 
should  be  about  equal  (except  for  sampling  error)  with  a  ratio  near  one, 
for  they  represent  independent  estimates  of  o-f.  We  divide  the  second  by 
the  third,  and  reject  the  null  hypothesis  at  the  significance  level  a  if  the 
quotient  F  exceeds  the  critical  F  value  from  Appendix  Table.  5  for 
fi  —  r  —  1  and  f->  =  n  —  r  degrees  of  freedom.  A  value  for  F  greater  than 
or  equal  to  the  tabled  value  Fa(r  ~  1, //  —  r)  would  occur  in  random 
sampling  with  probability  only  a  if  the  null  hypothesis  were  true.  For  the 
F  tests  in  this  chapter,  we  are  interested  in  only  one  tail  of  the  distribution, 
since  the  variability  we  want  to  detect  can  only  tend  to  increase  the  among- 
treatments  variance,  never  to  decrease  it. 

If  the  null  hypothesis  is  false,  only  if  of  the  three  mean  squares  is  an 
unbiased  estimate  of  In  this  case,  the  xt.  are  the  best  estimates  of  the 
different  population  means  /l<  .4- 

If  the  quotient  F  does  not  exceed  Fa(r  ~  1,  n  —  r),  the  data  are  con¬ 
sistent  with  the  null  hypothesis  that  the  means  are  all  equal.  In  this  case 
the  sample  grand  mean  x  is  taken  as  the  best  estimate  of  the  common 
population  mean  fi.  The  calculations  of  the  three  estimates  of  can  be 
made  conveniently  as  shown  in  Table  5.4.  The  last  column  of  the  table  is 
appropriate  only  in  Type  II  problems. 

As  is  indicated  by  comparison  of  the  last  two  columns,  jf  estimates  the 
variance  af  caused  by  variability  within  the  individual  treatments;  and  if 
the  treatments  represent  a  random  selection  from  the  population  of  levels 
taken  on  by  the  factor  (Type  II  problem),  the  quantity 


estimates  the  variance  <r2a  caused  by  variability  among  the  treatments  (vari¬ 
ability  among  rows).  Notice  that  the  estimate  jH  obtained  from  considering 
the  dispersion  of  the  r  treatment  means  includes  some  fluctuation  caused 
by  within-treatment  variability,  which  the  operation  expressed  by  the 
equation  above  removes,  in  accordance  with  the  last  column  of  the  table. 

The  expressions  in  the  second  column  of  the  table  (the  sums  of  squares) 
have  been  abbreviated  to  (1),  (2),  and  (3)  for  convenience. 

The  estimate  s\  ~  (1 )/(«  —  1)  of  the  variance  could  be  formed  also,  but 
is  dependent  on  (2)  and  (3),  which  answer  directly  the  usual  questions  of 
interest. 
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Unequal  sample  sizes  for  the  different  treatments  are  allowable,  but 
not  recommended.  The  analysis  is  the  same  except  for  the  formula 


(2)  =  2 
l 


i,t _ 

n 


where  n  is  the  total  number  of  observations,  2  m\- 

i=l 


The  following  example  could  be  either  Type  I  or  Type  II,  according 
to  the  circumstances.  The  analysis  is  in  great  part  the  same  for  both,  and 
will  be  carried  out  for  both  types. 


Example  (Type  I  Problem).  The  data  of  Table  5.5  (a  specific  case  of 
the  tabular  form  shown  in  Table  5.3)  represent  coded  readings  on  widths 
of  propellant  carpet  rolls  from  10  production  lots,  with  12  samples 
selected  randomly  from  each  lot.  The  factor  to  be  considered  for  its 
effect  on  width  of  roll  is  the  lot.  The  r  —  10  levels  of  the  factor  are  the 
treatments,  each  of  which  received  m—  12  replications.  The  lots  show 
different  mean  sample  widths.  Is  this  difference  large  enough  compared 
with  the  variability  within  lots  to  indicate  a  real  difference  among  lots? 


TABLE  5.5.  One-Way  Classification  of  Propellant 
Carpet-Roll  Data 


Samples 


1 

2 

3 

4 

5 

6 

B 

8 

9 

10 

11 

12 

1 

40 

30 

38 

23 

35 

38 

52 

59 

28 

37 

44 

54 

2 

52 

44 

47 

3 

31 

2 

2 

50 

59 

32 

2 

60 

3 

59 

5 

3 

5 

44 

8 

46 

0 

19 

19 

15 

29 

4 

44 

12 

23 

21 

32 

6 

40 

3 

52 

32 

29 

44 

5 

50 

35 

50 

25 

4 

50 

40 

3 

32 

28 

22 

54 

6 

20 

55 

31 

30 

40 

40 

42 

46 

30 

52 

15 

46 

7 

19 

58 

40 

8 

54 

38 

56 

45 

50 

24 

21 

0 

8 

57 

7 

46 

33 

4 

19 

7 

38 

47 

52 

24 

42 

9 

10 

42 

24 

9 

54 

35 

18 

16 

50 

48 

38 

15 

10 

49 

7 

43 

33 

39 

2 

3 

10 

25 

6 

L?j 

38 

Note:  r  =  10  treatments;  m  —  12  replications;  n  =  120  observations; 
x—  30.8. 
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TABLE  5.6.  Calculations  on  Carpet- Roll  Data 


In  using  a  machine  computing  technique,  only  the  total,  not  the  separate 
entries,  for  the  column  of  squared  sums  would  be  tabulated. ' 


(3)  =  (1)  -  (2)  = 


4,544,876  -  478,156 
120 


33,889.333 


For  a  Type  I  problem,  the  10  lots  of  carpet  rolls  in  Table  5.5  are 
assumed  to  be  lots  of  particular  interest  specified  by  the  experimenter, 
say  the  first  10  lots  produced.  We  assume  (Sec.  5.4.1)  that  the  widths 
for  each  lot  are  distributed  normally  with  variance  <r§. 

From  the  data  of  Table  5.5  we  form  the  sums  of  Table  5.6.  (In  using 
a  machine  computing  technique,  only  the  total,  not  the  separate  entries, 
for  the  column  of  squared  sums  would  be  tabulated.  The  separate  squares 
are  shown  to  accustom  the  reader  to  the  summation  notation.)  Table  5.7 
is  our  analysis-of-variarce  table. 

To  test  the  null  hypothesis  that  the  10  treatment  means  are  equal,  we 
compute  the  quotient  F  of  the  among-treatments  mean  square  by  the 
within-treatments  mean  square.  To  make  the  test  at  the  5%  significance 
level,  we  compare 
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r 


442.737 

308.085 


1.437 


with  Fa  (r  —  1,  n  —  r)  =  F05  (9,  110).  From  Appendix  Table  5  we  find 
that  F05(9,  110)  is  at  least  1.96.  Since  1.437  is  less  than  F,05( 9,  110),  we 
cannot  reject  the  null  hypothesis  that  the  lot  means  are  equal;  in  other 
words,  the  data  are  consistent  with  the  null  hypothesis. 

Analysis  of  variance  in  the  example  above  is  a  way  of  determining 
whether  the  production  process  is  in  statistical  control;  a  control  chart  on 
means  answers  the  same  question  more  simply  and  rapidly,  though  more 
roughly  (see  Chapter  7). 

Example  (Type  II  Problem).  In  the  preceding  example,  the  10  lots 
were  specified  by  the  experimenter.  They  could  have  been  any  10  specific 
lots  of  interest  to  him. 

Suppose  that  10  lots  are  not  specified,  but  selected  at  random  from  a 
population  of  lots  whose  mean  roll  widths  are  distributed  with  variance 
o-ji,  and  that  the  widths  within  each  lot  have  a  normal  distribution  with 
variance  al.  Now  we  have  a  Type  II  problem  and  we  may  extend  the 
analysis  to  include  components  oc  variance.  Our  object  is  to  estimate  the 
between-lots  variance  a\  (by  4),  and  the  within-lots  variance  <rl  (by  r§). 

Using  Tables  5.4  and  5.7,  we  find  the  estimates 

s‘i  =  308.085  and  ms\  +  s‘(  =  442.737 


Then 


442.737  -  308.085 
12 


11.221 


The  estimated  standard  deviation  caused  by  variability  among  all  lots 
of  the  population  is  Y  11.221  •=  3.350.  The  estimated  standard  deviation 


TABLE  5.7.  Analysis  of  Variance  for  Carpet-Roll  Data 


Source  of 
variation 

- 1 

Sum  of 
squares 

Degrees  of 
freedom 

Mean 

tivjuul  C 

Among  treatments  . 

3,984.633 

9 

442.737 

Within  treatments  .... 

33,889.333 

110 

308.085 

Total  . 

37,873.967 
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caused  by  variability  within  the  individual  lots  is  Y  308.085  =  17.552. 
The  estimated  total  variance  in  carpet-roll  widths  in  the  population  of 
lots  is 

si  +  s'i=  11.221  +  308.085  =  319.306 

5.5  Two-Factor  Analysis 

5.5.1.  Two-Factor  Analysis  Without  Replication 

To  investigate  r  levels  of  factor  1  and  c  levels  of  factor  2,  we  may  take 
the  n  =  rc  observations  shown  in  Table  5.8.  We  have  the  underlying 
assumption  that  each  observation  may  be  written 

=  /X  +  «i  +  Pi  +  eu  (/'  =  1,  2,  •  •  •  ,  r,  j  -  1,  2,  •  •  •  ,  c) 

with 

2  «»  =  2  Ps  -  0 

i  j 

in  the  Type  I  model;  i.e.,  it  is  the  sum  of  an  over-all  mean,  a  row  effect,  a 
column  effect,  and  a  random  error.  It  is  also  assumed  that  the  errors  are 
independently  and  normally  distributed,  with  mean  zero  and  common 
variance  c rjj. 

Only  one  observation  is  taken  for  each  treatment;  i.e.,  the  experiment  is 
not  replicated.  There  are  n  —  rc  treatments — each  level  of  factor  1  is  used 
once  with  each  level  of  factor  2.  The  typical  analysis-of- variance  table  is 
shown  in  Table  5.9.  The  last  column,  indicating  the  components  of  vari¬ 
ance  (?l,  aj$,  and  al,  applies  only  to  Type  II  problems.  We  notice  that,  in 
addition  to  the  inclusion  of  the  among-columns  values,  Table  5.9  differs 
from  Table  5.4  in  that  it  gives  "residual”  values  instead  of  within- 
treatments  values.  Since  here  each  of  the  rc  treatments  receives  only  one 
trial,  we  do  not  have  a  direct  estimate  of  the  variance  of  a  whole  popula¬ 
tion  of  observations  of  that  treatment.  However,  under  the  assumptions, 
this  variance  is  &i  and  cun  be  estimated  by  the  residual  mean  square.  The 
row  or  column  mean  square  provides  an  estimate  of  <j\  only  if  the  cor¬ 
responding  null  hypothesis  that  every  at  =  0,  or  that  every  /5;  =  0,  is  true. 
Hence,  to  test  the  null  hypotheses,  we  compare  the  row  and  column  mean 
squares  with  the  residual  mean  square. 
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To  test  at  the  significance  level  0.05,  for  example,  the  null  hypothesis 
that  all  row  means  are  equal  (that  factor  1  has  no  effect),  we  compare 


with  F  „r,\r  -  l,(r  -  l)(r  -  1)J  from  Appendix  Table  5,  and  reject  the 
null  hypothesis  if  F  exceeds  the  tabled  value.  To  test  the  null  hypothesis 
that  the  r  'umn  means  are  equal  (that  factor  2  has  no  effect),  we  compare 


with  Fli:,\c—  1,  (r  —  l)(c  -  l)j.  The  significance  test  for  row  effects  is 
"orthogonal  to"  the  test  for  column  effects;  i.e.,  the  presence  or  absence  of 
column  effects  does  not  influence  the  test  for  row  effects  (and  vir^  versa), 
but  the  two  test  ratios  have  the  same  denominator  and  as  a  result  are  not 
statistically  independent.  (See  Ref.  5.12,  p.  321.) 

In  the  case  of  a  Type  II  problem  (Sec.  5.2.2),  the  expected-mean-squarc 
column  set  equal,  term  by  term,  to  the  mean-square  column  can  be  used  to 
obtain 

.f7  =  estimate  of  the  variance  o'i  not  caused  by  either  factor  by  itself  (Ref. 

5.12,  pp.  342-44) 

j;  =  estimate  of  the  variance  <75  caused  by  factor  1 
s'f,  =  estimate  of  the  variance  caused  by  factor  2 

Example  (Type  I  Problem).  Table  5.10  shows  the  results  of  an 
experiment  to  determine  the  effects  of  two  factors  (volume  of  sample  and 
operator)  on  the  variable  (percentage  of  nitroglycerin)  in  a  chemical 
determination.  The  experiment  is  designed  to  test  whether  the  chemical 
determination  of  the  percentage  of  nitroglycerin  in  a  solution  depends 
on  the  volume  of  the  sample  drawn  for  analysis. 

The  data  have  been  coded.  Three  levels  of  operator  were  chosen  (three 
specific  operators),  and  each  operator  made  determinations  on  five 
samples  of  different  volumes.  In  Table  5.10  only  the  total,  not  the 
separate  squares,  in  the  next-to-last  row  and  column  need  be  tabulated. 
Table  5.1 1  gives  the  analysis  of  variance. 
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TABLE  5.10.  Two-Way  Classification  (Without  Replication) 
of  Nitroglycerin  Data 


In  using  a  machine  computing  technique,  only  the  total,  not  the  separate 
squares,  would  be  tabulated  for  the  next-to-last  row  and  column. 


Volume 

of 

sample 

Operator 

) 

i 

No.  1 

No.  2 

No.  3 

V , 

4 

0 

7 

11 

121 

65 

Vi 

1 

2 

4 

16 

6 

V  3 

6 

3 

8 

17 

289 

109 

Vi 

7 

0 

6 

13 

169 

85 

V \ 

2 

9 

8 

19 

361 

149 

20 

14 

30 

64 

956 

400 

196 

900 

1,496 

106 

94 

214 

414 

Note:  c  =  3;  r  —  5;  n  =  15. 


TABLE  5.11.  Analysis  of  Variance  for 
Nitroglycerin  Da  fa 


Source  of 
variation 

Sum  of 
squares 

Degrees  of 
freedom 

Mean 

square 

Among  volumes  . 

45.600 

4 

11.400 

Among  operators  . 

26.133 

2 

13.067 

Residual  . 

69.200 

8 

8.650 

Total . 

140.933 

14 

15(414)  -  (64)'  _  2,114 


15 

15 

5  (956)  - 

(64)* 

684 

15 

15 

3  (1,496)  - 

(64)* 

392 

15 

15 

2,114  -  392  - 

-  684 

1,038 

140.933 

45.600 

26.133 

69.200 


15 


15 
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To  test  at  the  5%  significance  level  the  hypothesis  that  the  volume  of 
the  sample  used  in  the  determination  has  no  effect  on  the  result,  we 
compare 


11.400 

8.650 


=  1.318 


with  F05(4, 8)  =  3.84,  from  Appendix  Table  5.  We  cannot  conclude 
that  the  percentage  of  nitroglycerin  reported  depends  on  the  volume  of 
the  sample. 

To  test  the  null  hypothesis  that  the  column  means  are  equal;  i.e.,  that 
the  choice  of  operator  has  no  effect  on  the  percentage  of  nitroglycerin 
found  in  the  sample,  we  compare 


13.067 

"8.650 


1.511 


with  F„5( 2,8)  =4.46  from  Appendix  Table  5.  Since  1.511  does  not 
exceed  4.46,  the  data  are  consistent  with  the  null  hypothesis  that  the 
choice  of  the  operator  does  not  affect  the  percentage  of  nitroglycerin 
detected. 


Example  (Type  II  Problem).  Suppose  the  three  operators  in  the 
example  above  were  chosen  at  random  from  a  large  number  of  chemists, 
am!  the  five  volumes  were  selected  at  random  from  lists  showing  the 
volumes  drawn  for  determinations  during  the  past  year.  In  this  case  we 
would  be  justified  in  analyzing  the  components  of  variance. 

Referring  to  Tables  5.9  and  5.11,  we  find  the  estimates 

ag  +  s\  =11.400 

rs$  +  s'i  =  13.067 

si  =  8.630 

so  that 

s-  =  0.917  and  sj \  =  0.883 

The  estimate  of  the  total  variance  of  a  random  observation  is 


0.883  +  0.917  +  8.650  =  10.450 


S5.5. 


TWO-FACTOR  ANALYSIS 


133 


We  can  use  the  F  tests  of  the  previous  example  (Type  I  problem)  to 
test  whether  a\  =  0  and  whether  o'fc  —  0.  The  variability  caused  by  the 
operator  and  the  variability  caused  by  the  volume  are  thus  shown  to  be 
not  significantly  different  from  zero.  Aside  from  the  statistical  significance 
(in  the  technical  sense  of  the  size  of  the  effect  relative  to  the  random 
fluctuation),  we  see  by  inspection  that  the  standard  deviation  of  a  ran¬ 
dom  determination  (yj  10.450  =  3.23)  is,  in  a  practical  sense,  negligibly 
more  than  the  standard  deviation  of  a  determination  with  operator  and 
volume  effects  eliminated  (\  8.650  =  2.94).  The  experimental  data  indi¬ 
cate  that  little  increase  in  the  reproducibility  of  determinations  is  achieved 
by  holdirg  either  operator  or  volume  fixed;  however,  the  variance  esti¬ 
mates  are  subject  to  considerable  uncertainty  because  of  the  small  size 
of  the  experiment. 

5.5.2.  Two-Factor  Analysis  With  Replication 

Only  the  Type  I  problem  is  treated  here.  Suppose  that  in  Table  5.8  the 
set  of  treatments  is  applied  not  once,  but  m  times  under  the  same  conditions. 
Then  the  cell,  or  treatment,  (/',/')  will  contain  the  m  observations  xiju 
xif2,  •  •  •  .  xijm-  (Reference  5.13  treats  the  case  in  which  the  number  of 
observations  varies  from  cell  to  cell.)  Here  we  have  the  mathematical  model 

xijt  =  n  +  <*i  +  fa  +  8„  +  eijt  (;  =  1,  2,  •  •'  •  ,  r) 

(j  =  1,  2,  •  •  •  ,  c) 
(/  =  1,  2,  •  •  •  ,  m) 

where 

H  =  over-all  mean 
=  row  main  effect 
/ 3j  =  column  main  effect 

Sij  =  interaction  of  factor  1  at  level  /  and  factor  2  at  level  j  (a  joint 
effect,  beyond  the  total  of  their  individual  effects) 

eijt  =  random  deviation  from  the  mean  position  of  the  /th  item  receiv¬ 
ing  factor  1  at  level  i  and  factor  2  at  level  ;.  The  e\ have 
independent  normal  distributions  with  mean  zero  and  common 
variance  erg. 
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In  the  Type  I  model,  2  “«  ~  2  J8;  =  °» 2  8  a  =  0  for  /  =  1,  2,  .  .  .  ,  c, 

J 

and  2^i;  =  0  for  /  =  1,  2,  .  .  .  ,  r.  The  replication  of  the  treatments 
i 

makes  it  possible  tr  estimate  the  interaction.  The  total  number  of  observa¬ 
tions  n  is  rcm.  The  choice  of  successive  items  for  each  cell  and  the  order  of 
experimentation  on  them  should  be  random  among  all  rcm  observations. 

We  begin  the  computations  by  carrying  out  the  operations  indicated  in 
Tables  5.8  and  5.9  on  the  sums  of  the  observations  in  each  cell,  with 
n  =  rcm.  Let 


^■ij  2  xij  i 

t= i 

The  calculations  on  the  X’s  yield  the  first  four  rows  of  the  analysis  of 
variance  shown  in  Table  5.12.  The  remaining  two  rows  are  then  computed, 
using  the  total  sum  of  squares 

rtc,m  r,c,m 

"  2  An  ~  (  2  A  a)2 

i.J.t _ h)jt _ 

n 

As  the  first  step  in  significance  testing,  we  test  the  interaction  against 
the  within-treatments  mean  square.  We  compare 


with  Fa[(r  —  1 ) (c  l),  rc{m  —  1)]  from  Appendix  Table  5.  If  F  <  Fc> 
we  cannot  reject  the  null  hypothesis  that  the  interactions  are  zero.  If 
F  >  Fa,  we  reject  at  the  significance  level  a  the  null  hypothesis  that  there 
is  zero  interaction,  on  the  grounds  that  such  a  large  value  of  F  would 
occur  with  probability  only  a  if  the  null  hypothesis  held. 

Interaction  can  appear  to  be  present  for  the  following  reasons : 

1.  The  two  factors  operating  together  have  effects  beyond  the  com¬ 
bination  of  their  separate  effects;  that  is,  a  non-zero  interaction  is  actually 
present.  For  instance,  this  might  occur  in  the  example  given  in  Sec.  5.5.1 
if  one  operator,  though  accurate  when  using  large  samples,  tended  to  under- 
ecbmate  the  percentage  of  nitroglycerin  in  small  samples. 

2.  Significant  interactions  will  appear  with  probability  «  as  a  result  of 
random  fluctuations  alone. 


TABLE  5.12.  Analysis  of  Variance  for  Two-Way  Classification 

(With  Replication) 
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3.  Apparent  interactions  may  be  one  of  several  unexplained  effects 
occurring  when  important  variables  are  left  out  of  the  analysis. 

To  test  for  row  or  column  main  effects,  we  form  an  F  ratio  with  the 
row  or  column  mean  square  in  the  numerator  and  a  mean  square  that  is 
an  apx  ropriate  estimate  of  uncontrolled  variability  in  the  denominator.  In 
an  experiment  designed  with  proper  randomization,  such  an  estimate  is 
provided  by  the  within-treatments  mean  square  (jj;  in  Table  5.12).  Unfor¬ 
tunately,  however,  experiments  may  be  performed  without  adequate  ran¬ 
domization  of  the  order  in  which  treatments  are  applied,  so  that  background 
conditions  may  remain  nearly  constant  within  each  treatment,  but  vary 
from  treatment  to  treatment.  As  a  result,  the  uncontrolled  variation  in 
background  conditions  tends  to  enlarge  the  mean  squares  for  interaction 
and  main  effects,  but  not  the  within-treatments  mean  square.  The  appro¬ 
priate  estimate  of  uncontrolled  variability  is,  in  this  case,  the  interaction 
mean  square  s$.  (For  further  discussion  related  to  the  somewhat  contro¬ 
versial  question  of  which  denominator  to  use,  see  Ref.  5.5,  pp.  136-39; 
Ref.  5.12,  pp.  337-49;  and  Ref.  5.15,  pp.  119-23.) 

When  significant  effects,  especially  with  interactions,  are  found  in  an 
analysis  of  variance,  it  is  helpful  to  present  them  graphically.  (For  a  sample 
graph,  see  Fig.  5.1  in  Sec.  5.6.)  If  interactions  are  significant,  further 
analysis  may  be  desirable  (Ref.  5.4,  pp.  139-46;  Ref.  5.1,  pp.  270-72; 
and  Ref.  5.9,  pp.  456-79). 

Example.  In  the  ~ourse  of  deciding  on  tactical  uses  for  two  rockets, 
an  operations  analyst  iequests  the  experiment  reported  in  Table  5.13. 
Three  types  of  planes  are  used  five  times  with  each  of  two  types  of 
rockets.  The  entries  in  Table  5.13  represent  coded  evaluations  of  the 
target  destruction  obtained  in  the  tests.  The  experiment  is  designed  to 
test  whether  the  target  destruction  depends  on  the  type  of  rocket  or  the 
type  of  plane,  or  on  the  interaction  of  the  two  factors,  making  use  of 
significance  tests  at  the  5%  level. 

To  analyze  the  data,  we  replace  the  five  values  in  each  cell  by  the  sum 
of  those  values,  and  carry  out  the  preliminary  calculations  shown  in 
Table  5.14.  (In  this  table  only  the  sums  have  been  entered  in  the  next- 
to-last  row  and  column.)  Table  5.15  shows  the  analysis  of  variance. 
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TABLE  5.13.  Two-Way  Classification  (With 
Replication)  of  Rocket-Comparison  Data 


Rocket 

type 

1 - 

Plane  type 

I 

II 

III 

A 

0 

1 

6 

2 

3 

5 

3 

3 

7 

b 

5 

6 

5 

6 

6 

B 

3 

2 

6 

5 

3 

7 

7 

4 

5 

2 

3 

8 

5 

6 

8 

TABLE  5.14.  Calculations  on  Rocket-Comparison  Data 


Rocket 

Plane  type 

2*w 

j 

CS*,/ 

J 

2*o 

; 

type 

I 

II 

III 

A 

16 

18 

30 

64 

1,480 

B 

22 

18 

34 

74 

1,964 

38 

36 

64 

138 

9,572 

(2*./ 

6,836 

i 

740 

648 

2,056 

3,444 

Test  for  interaction 


Since  this  F  value  is  less  than  1  and  therefore  certainly  less  than 
F05( 2,  24),  the  interaction  is  not  significant.  Incidentally,  F  values  less 
than  1  occur  with  a  substantial  frequency  in  such  computations  if  the  n  ill 
hypothesis  is  true. 


0.933 

F  -  2.967  =  0  314 
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Test  for  rocket  effects 

F  =  Illy  =  1.12  and  F„(  1,  24)  =  4.26 
We  conclude  that  the  difference  among  rockets  is  not  significant. 


Test  for  plane  effects 


24.400 

2.967 


8.22 


and  E,,,( 2,  24)  =  3.40 


We  conclude  that  the  difference  among  planes  is  significant. 


TABLE  5.15.  Analysis  of  Variance  for 
Rocket-Comparison  Data 


Source  of  variability 

Sum  of  squares 

Degrees  of 
freedom 

Mean  square 

Among  rockets  . 

3.333 

1 

3.333 

Among  planes . 

48.800 

2 

24.400 

Interaction  . 

1.867 

2 

0.933 

Subtotal . 

54.000 

5 

Within  treatments  . 

71.200 

24 

2.967 

Total  . 

125.200 

29 

6(3,444)  -  (138)'  _  1,620  __ 


30 


30 


54.000 


2  (9,572)  ~  (138)=  =  UK)  _ 

30  30 

=  =  48.800 


1,620  -  1,464  -  100  56 


30 


30 


1.867 


30  (Q!  +  2‘  +  52  -i-  62  +  52  +  32  +  52  +  •  •  •  +  82)  -  (138)2 

30 


=  125.200 


125.200  -  54.000  =  71.200 
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5.6.  Multiple-Factor  Analysis 

The  methods  described  above  can  be  extended  to  cover  any  number  k 
of  factors.  We  treat  such  data  in  k  individual  ( k  —  l)-factor  tables  obtained 
by  summing  observations  over  each  factor  in  turn.  After  the  (k  —  l)-factor 
tables  have  been  calculated  separately,  the  residual  for  an  experiment  with¬ 
out  replication  is  found  by  subtracting  all  sums  of  squares  in  the  (k  —  1)- 
factor  tables  from  the  total  sum  of  squares  of  deviations  of  individual 
observations  from  the  over-all  mean.  The  complexity  and  number  of 
interactions  increase  rapidly,  of  course,  but  the  method  is  general. 

The  following  example  shows  the  analysis  of  an  experiment  involving 
three  factors. 

Example.  The  observed  variable  (coded)  is  the  outside  diameter 
of  a  propellant  grain.  The  purpose  of  the  experiment  is  to  study  the 
effects  on  outside  diameter  of  intentionally  varying  extrusion  conditions. 
The  factors  are  extrusion  rate  (factor  A,  three  levels),  die  temperature 
(factor  B,  three  levels),  and  charge  temperature  (factor  C,  two  levels). 
The  upper  number  of  each  pair  of  entries  in  Table  5.16  corresponds 
to  charge  temperature  C,;  the  lower,  to  charge  temperature  C2. 

First  we  compute  3  two-factor  analyses  on  the  sums  of  observations 
over  each  factor  individually.  Each  of  these  analyses  is  obtained  by 
considering  the  levels  of  one  factor  as  replications  and  proceeding  as 

TABLE  5.16.  Three-Way  Classification  (Without 
Replication)  of  Outside-Diameter  Data 

The  upper  number  of  each  pair  of  entries  corresponds  to  charge 
temperature  G;  the  lower,  to  charge  temperature  G. 


Die 

temp. 

B 

Extrusion  rate  A 

A , 

A , 

A , 

Bt 

78 

66 

56 

34 

23 

5 

Bt 

85 

84 

75 

70 

52 

54 

b3 

92 

93 

96 

79 

i _ — _ 1 

64 
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in  Sec.  5.5.2.  In  Table  5.17,  for  instance,  the  data  for  extrusion-rate 
versus  die-temperature  analysis  were  obtained  by  summing  over  the  two 
levels  of  charge  temperature  for  each  entry.  Table  5.18  shows  the 
analysis  of  variance  for  this  case. 

Tables  5.18,  5.20,  and  5.22  can  be  summarized,  and  the  three-way 
analysis  completed  as  in  Table  5.23.  The  sums  of  squares  are  carried  to 
several  decimal  places  to  provide  sufficient  significant  figures  in  the 
relatively  small  residual  sum  of  squares  obtained  by  subtraction.  An 
experienced  variance  analyst  can  omit  intermediate  Tables  5.17-5.22 
by  adjoining  simple  sums  (but  not  sums  of  squares)  to  Table  5.16  and 
entering  the  final  sums  of  squares  directly  into  Table  5.23. 

The  F  ratios  for  testing  significance  of  the  main  effects  and  two-factor 
interactions  are  formed  by  dividing  the  appropriate  mean  squares  by  the 
residual  mean  square.  Thus  we  have 


F  ratio 

Critical  value 

Significance 

F„5(  2, 4)  =  6.94 

significant 

significant 

4.170.M9 

21.806 

For, (1,4)  =  7.71 

significant 

-  M  °83-  =  2.48 

AB  21.806 

For  (4, 4)  =  6.39 

not  significant 

_  48.222 

Fu.  = - t  =  2.21 

•'  21.806 

not  significant 

272  222 

'  21.806  =  12  48 

significant 

.til  three  factors  thus  have  real  main  effects, 

but  the  effects  of  die 

temperature  and  charge  temperature  are  not  independent.  Just  what 
these  effects  are  should  be  evaluated  by  referring  to  the  original  data  in 
Table  5.16.  The  effects  may  then  be  presented  graphically,  as  in  Fig. 
5.1.  If  interactions  are  significant,  further  analysis  may  be  desirable 
(Ref.  5.4,  pp.  139-46  and  Ref.  5.1,  pp.  270-72). 
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TABLE  5.17  Two-Way  Table  Derived  From 
Outside-Diameter  Data 


Die 

temp. 

B 

Extrusion  rate  A 

V  Y 

Zi  Ao 

i 

(2*-,)1' 

) 

2*- 

> 

A, 

Ai 

As 

Bt 

112 

89 

61 

262 

24,186 

B 2 

155 

136 

129 

420 

59,162 

B> 

171 

163 

160 

494 

81,410 

438 

388 

350 

1,176 

489,080 

(2Xn)2 

464,888 

i 

65,810 

52,986 

45,962 

164,758 

TABLE  5.18.  Analysis  of  Variance  for 
Two-Way  Table  5.17 


Source  of  variation 

Sum  of  squares 

Degrees  of 
freedom 

Mean  square 

Extrusion  rate  A . 

649.333 

2 

324.667 

Die  temperature  B . 

4,681.333 

2 

2,340.667 

Interaction  AX  B  . 

216.333 

4 

54.083 

Subtotal  A  X  B  . 

5,547.000 

8 

9(164,758)  -  (l,176); 

18 

3  (464,888)  -  (l,  176)g 
18 

3  (489,080)  -  ( 1, 176)* 

18 

99,846  -  1  1,688  -  84,2<4 
18 

If  several  observations  had  been  taken  for  each  of  the  18  treatments, 
we  would  have  had  a  within-treatments  mean  square  and  could  have 
tested  the  significance  of  the  three-factor  interaction  A  X  B  X  C  also. 
Our  failure  to  replicate  must  be  based  on  an  assumption  that  the  three- 


99,846 

18 

1 1,688 
18 

84,264 

18 

3,894 

18 


5,547.000 

649.333 

4.681.333 

216.333 
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TABLE  5.19.  Two-Way  Table  Derived  From 
Outside-Diameter  Data 


TABLE  5.20.  Analysis  of  Variance  for 
Two-Way  Table  5.19 


Source  of  variation 

Sum  or  squares 

Degrees  of 
freedom 

Mean  square 

Extrusion  rate  A . 

649.333 

2 

324.667 

Charge  temperature  C  .  . 

4,170.889 

1 

4,170.889 

Interaction  A  X  C  . 

96.444 

2 

48.222 

Subtotal  A  X  C  . 

4,916.667 

5 

6(245,246)  -  (l,176)~  _  88,500 


3  (464,888)  -  (1,176)*  _  11,688 
18  18 


649.333 


2» (729,026)  --  (1,176)-  _  75,076 
18  18 


=  4,170.889 


88,500  -  11,688  -  75,076  _  1,736 
18  ~  18 


96.444 


factor  interaction  is  negligible  compared  with  the  uncontrolled  vari¬ 
ability.  The  analysis  of  variance  for  the  three-factor  experiment  with 
replication  is  obtained  from  the  analysis  without  replication  by  the  same 
extension  used  for  two  factors  (Sec.  5.5.2);  with  proper  randomization 
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TABLE  5.21.  Two-Way  Table  Derived  From 
Outside-Diameter  Data 


Charge 

temp. 

C 

Die  temperature  B 

Sum 

Squared 

sum 

Sum  of 
squares 

Bi 

B-. 

b3 

C,  . 

200 

244 

281 

725 

178,497 

c,  . 

62 

176 

213 

451 

80,189 

Sum . 

262 

420 

494 

1,176 

729,026 

Squared 

sum  .... 

489,080 

Sum  of 

squares.  . 

43,844 

90,512 

124,330 

258,686 

TABLE  5.22.  Analysis  of  Variance  for 


Two-Way  Table  5.21 


Source  of  variation 

Sum  of  squares 

Degrees  of 
freedom 

Mean  square 

Die  temperature  B . 

4,681.333 

2 

2,340.667 

Charge  temperature  C  .  .  . 

4,170.889 

1 

4,170.889 

Interaction  B  X  C . 

544.444 

2 

272.222 

Subtotal  B  X  C  . 

9,396.667 

5 

6(258.686)  -  (1,176)= 

18 

3(489,080)  -  (1,176)- 
18 

2(729,026)  -  (1,176)- 
18 

169,140  -  84,264  -  75,076 
18 

of  the  order  in  which  observations  are  made,  the  within-treatments 
mean  square  is  the  appropriate  denominator  for  all  the  F  ratios.  If,  on 
the  other  hand,  the  experiment  were  performed  with  replication  but 
without  adequate  randomization,  then  for  testing  main  effects  and  two- 
factor  interactions,  the  appropriate  denominator  would  be  the  three- 


169,140 

18 

84,264 

18 

75,076 

18 

9,800 

18 


=  9,396.667 
=  4,681.333 
=  4,170.889 
=  544.444 
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TABLE  5.23.  Three-Way  Analysis  of  Variance 
for  Outside-Diameter  Data 


Source  of 
variation 

Sum  of 
squares 

Degrees  of 
freedom 

Mean 

square 

Extrusion  rate  A  .  . .  . 

649.333 

2 

324.667 

Die  temperature  B  . .  . 

4,681.333 

2 

2,340.667 

Charge  temperature  C  . 

4,170.889 

1 

4,170.889 

Interaction  AX  B  ... 

216.333 

4 

54.083 

Interaction  AX  C  ... 

96.444 

2 

48.222 

Interaction  B  X  C  ... 

544.444 

2 

272.222 

Residual  . 

87.224 

4 

21.806 

Total  . 

10,446.000 

17 

18(78*  +  34*  +  85*  +  70*  +  92*  +  79*  +  66*  -f  23*  +  •  •  •  +  64*)  -  (l,176)“ 

18 

=  (111?!)'  -  iQ,44c.ooo 

18 

10,446.000  -  (649.333  +  4,681.333  +  4,170.889  +  216.333  +  96.444  +  544.444) 

=  87.224 

factor  interaction  mean  square.  (See  the  similar  discussion  in  Sec.  5.5.2, 
where  references  are  given.) 

The  analysis  of  the  corresponding  Type  II  problem  has  a  similar  gen¬ 
eral  form  but  differs  in  the  formation  of  the  F  ratios;  see  the  references 
in  Sec.  5.5.2. 
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(a)  (b) 

FIG.  5.1.  Graphs  of  the  Derived  Two-Way  Data  of  Tables  5.19  and  5.21,  Showing 
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charge  temperature  for  three  levels  of  extrusion  rate  A.  The  interaction  A  X  C  is  not 
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Chapter  6 

FITTING  A  FUNCTION  OF  ONE  OR  MORE  VARIABLES 


6.0.  Introduction 

Often  we  want  to  predict  the  value  of  a  variable  y  for  any  given  values  of 
one  or  more  variables  Xj.  Sometimes  a  physical  law  connects  the  variables 
so  that  y  may  be  expressed  as  a  function  of  the  x’s.  For  instance,  the  formula 
y  =  V2  gx2  enables  us  to  predict  the  distance  y  traversed  by  a  falling  body 
in  time  x.  However,  the  x  and  y  values  may  follow  a  regular  law  only 
ideally,  so  that  if  they  are  plotted  as  points  on  an  xy  plane,  they  lie  scattered 
about  the  curve  that  represents  the  ideal  formula.  Experimenters  find  such 
a  "scatter  diagram"  valuable  in  exploratory  work  for  relationships  among 
variables.  (For  an  example  of  a  scatter  diagram  see  Fig.  6.3  in  Sec.  6.1.1.) 

From  past  theoretical  or  experimental  work  it  may  be  hypothesized  that 
the  relation  among  several  variables  is  of  a  given  form  (e.g.,  a  second- 
degree  polynomial)  without  necessarily  specifying  the  numerical  values  of 
all  the  constants  in  the  equation.  Regression  analysis  provides  a  systematic 
technique  for  estimating,  with  confidence  limits,  the  unspecified  constants 
from  a  new  set  of  data,  or  for  testing  whether  the  new  data  are  consistent 
with  the  hypothesis. 

An  important  special  case  of  a  function  of  x’s  for  predicting  y  is  the 
linear  function.  Section  6.1  treats  y  as  a  linear  function  of  one  other 
variable  x;  Sec.  6.2  treats  y  as  a  linear  function  of  several  x’s;  Sec.  6.1.6 
gives  a  test  for  linearity;  Sec.  6.3  discusses  transformations  of  non-linear 
data  to  reduce  the  problem  to  the  linear  case  and,  in  particular,  treats  y  as 
a  polynomial  function  of  x;  and  Sec.  6.4  considers  the  important  statistical 
question  of  planning  the  experiment  for  fitting  a  function. 

By  means  of  analysis  of  variance  (Chapter  5),  we  may  determine  whether 
certain  chosen  values  of  each  of  several  variables  xx,  x2,  •  •  •  ,  xk  differ 
in  their  effects  on  a  dependent  variable  y\  the  x;-  need  not  be  continuous 
variables,  and  their  effects  need  not  have  any  given  functional  form.  By 
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regression  analysis  we  may  determine  the  functional  effect  of  continuous 
variables  xu  x2,  •  •  •  ,  xk.  Regression  analysis  can  be  considered  a  special 
case  of  variance  analysis,  but  it  is  treated  separately  because  of  its  special 
techniques  and  importance.  A  third  method,  the  analysis  of  covariance,  is 
especially  important  when  some  condition  of  the  experiment  cannot  be  held 
constant.  By  covariance  analysis  we  can  describe  and  (if  desired)  remove 
the  effect  of  this  changing  condition,  and  test  the  effect  of  the  other 
variables  on  the  remaining  variation  in  y.  For  analysis  of  covariance,  see 
Ref.  6.4,  Chap.  12;  Ref.  6.13,  Chap.  12;  and  Ref.  5.15,  pp.  173-81. 

A  statistically  significant  regression  relation  of  y  on  X  (and  similarly  for 
more  than  one  independent  variable)  is  no  indication  that  the  "independent” 
variable  x  causes  the  observed  change  in  the  "dependent”  variable  y ,  as 
both  may  be  caused  to  vary  by  a  neglected  third  variable.  For  example,  fi 
the  order  is  not  random  during  an  experiment  (as  when  the  successive  x 
values  are  chosen  in  decreasing  order),  a  change  in  y  may  reflect  not  the 
influence  of  x,  but  that  of  some  hidden  variable,  such  as  temperature 
warm-up  or  tool  wear,  that  changes  steadily  during  the  course  of  the 
experiment.  If  the  identity  of  the  basic  causative  variables  is  known  from 
scientific  or  engineering  knowledge  of  the  phenomenon,  regression  analysis 
of  appropriate  data  serves  to  estimate  the  constants  of  the  relation  (see  Ref. 
6.7,  pp.  522-26,  613-14,  and  634). 

Though  a  regression  equation  is  not  necessarily  built  on  a  causal  relation¬ 
ship,  it  does  provide  a  valid  prediction  of  the  y  value  for  each  possible  set 
of  x  values  to  within  assessable  random  fluctuation.  However,  the  prediction 
is  no  longer  valid  if  the  situation  has  changed  since  the  data  were  taken. 

The  true  or  population  regression  curve  of  y  on  x  gives  for  each  x 
the  mean  of  the  corresponding  y  distribution.  The  equation  or  curve 
obtained  as  a  result  of  data  analysis  is  an  approximation  to  the  true 
regression  curve  and  may  be  used  to  predict  y  values.  In  the  linear  case, 
the  curve  is  called  the  "regression  line.”  When  there  are  several  factors 
Xj,  x2,  •  •  •  ,  Xfc,  the  preceding  statements  hold  if  the  words  "surface”  and 
"plane”  are  substituted  for  "curve”  and  "line.” 

Regression  analysis  may  be  applied  to  data  gathered  in  two  ways 
(Fig.  6.1): 

1 .  The  amount  of  the  effect  y  may  be  measured  for  certain  values  of  the 
factor  x  chosen  at  will  by  the  experimenter  (Type  I  problem). 
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2.  The  points  (x,  y)  may  be  chosen  at  random  from  a  two-dimensional 
normal  distribution  of  points  in  the  xy  plane  (Type  II  problem). 

Thus,  to  investigate  the  effect  of  propellant-grain  weight  on  rocket 
range,  we  may  (1)  load  the  different  rounds  with  grains  having  particular 
weights  of  interest  and  observe  the  range  for  each  one  (Type  I  problem); 
or  (2)  measure  the  grain  weight  and  the  range  for  each  of  several  rounds 
drawn  at  random  from  the  population  of  rounds  (Type  II  problem). 

In  either  case,  the  y  value  observed  is  considered  to  be  a  random  observa¬ 
tion  from  a  normal  population  of  possible  y’s  for  each  particular  x  used. 
These  y  populations  are  assumed  to  have  a  common  standard  deviation.  The 
assumptions  of  normality  and  common  variance  are  not  necessary  for  simply 
fitting  an  equation  from  data,  but  are  necessary  for  significance  tests  and 
confidence  intervals  given  here. 

The  joint  two-dimensional  normal  distribution  from  which  the  point 
(x,  y)  is  drawn  at  random  in  Type  II  problems  is  illustrated  in  Fig.  6.2.  Not 
only  is  each  of  x  and  y  normally  distributed  by  itself  irrespective  of  the 
value  of  the  other  variable,  but  for  each  fixed  x  it  can  be  shown  that 
the  distribution  of  y  is  normal,  with  its  mean  a  linear  function  of  x,  and 
that  all  the  y  distributions  have  a  common  standard  deviation.  By  the 
previously  giver,  definition,  the  regression  curve  of  y  on  x  is  a  straight  line 
in  this  case,  as  shown  in  Fig.  6.1b  and  6.2.  In  Type  I  problems  the  regres¬ 
sion  curve  need  not  be  straight;  if  it  is,  the  Type  I  and  Type  II  problems 
have  so  much  in  common  that  almost  all  the  analysis  is  the  same,  and 
often  the  types  have  not  been  distinguished. 

When  a  regression  function  has  been  derived  from  the  data  to  represent 
the  relationship  between  the  effect  and  the  given  factor,  we  may  test  the 
reliability  of  that  function,  which  is  subject  to  the  sampling  error  of 
the  original  data.  It  is  in  the  investigation  of  reliability  that  the  analyses  for 
Type  I  and  Type  II  regressions  differ  slightly. 

As  shown  in  Fig.  6.1,  we  have  a  whole  population  of  y’s  for  each  x.  All 
these  populations  have  a  common  standard  deviation,  denoted  by  crylT.  If 
<r„|T  is  large,  the  regression  line  (or  more  generally,  the  regression  surface), 
which  predicts  only  the  mian  of  the  population  of  y’s  for  a  given  x,  will 
be  of  little  use.  Indeed,  if  the  number  of  observations  (x,  y)  is  small,  not 
even  the  mean  will  be  estimated  very  precisely  by  the  fitted  regression 
equation. 


§6.0 


INTRODUCTION 


151 


/ 


FIG.  6.2.  Two-Dimensional  Normal  Distribution. 


In  a  regression  analysis  the  assumption  is  made  that  the  mean  of  y  is  a 
linear  function  of  the  x’s  (more  generally,  a  function  of  given  form).  If 
the  resulting  equation  is  used  for  predicting  beyond  the  range  of  x’s  for 
which  this  assumption  holds  true,  errors  not  accounted  for  in  the  confidence 
intervals  for  the  regression  will  occur.  For  this  reason,  extrapolation  is 
inadvisable. 

The  general  method  used  in  estimating  a  population  regression  curve 
from  sample  data  is  the  method  of  least  squares.  The  sample  regression 
curve  of  y  on  x  of  given  degree  is  the  curve  among  all  those  ot  that  degree 
that  minimizes  the  sum  of  squares  of  vertical  ( y )  deviations  of  the  observed 
points  from  the  curve.  The  method  of  least  squares  lias  general  application 
in  curve  fitting  (Sec.  6.3). 


152 


FITTING  A  FUNCTION 


§6.1. 


Not  all  the  possible  types  of  problems  in  fitting  equations  to  data  can 
be  treated  rigorously  by  the  methods  of  this  Manual;  that  is,  as  Type  I  or 
Type  II  problems.  Models  that  are  of  neither  type  arise  whenever  the 
observations  on  x  include  measurement  error  in  addition  to  any  inherent 
variability.  (The  variation  of  y  in  Type  I  and  Type  II  problems  may  result 
from  measurement  error  or  inherent  variability,  or  both.)  A  statistician 
should  be  consulted  if  a  problem  does  not  appear  to  fit  the  models  provided 
here;  however,  rigorous  solutions  to  some  of  the  more  complicated  problems 
are  not  known. 

6.1.  Linear  Regression  (Two  Variables) 

It  is  assumed  that  there  are  only  two  variables  of  interest,  an  "independ¬ 
ent”  variable  x,  and  a  "dependent”  variable  y,  and  that  a  sample  (x1(  yx\ 
x2,  y2;  '  •  1  ;  xn,  yn)  is  drawn.  The  curve  of  the  y  mean  values  for  the 
population  is  assumed  to  be  a  straight  line.  A  test  of  this  assumption  is 
given  in  Sec.  6.1.6.  If  the  curve  is  not  straight,  it  may  be  known,  or  perhaps 
can  be  assumed,  to  be  of  a  form  that  can  be  treated  as  explained  in  Sec.  6.3. 

6.1.1.  Regression  Line 

The  equation  of  the  sample  regression  line  of  y  on  x  is 

y'  =  a  +  bx 

where  the  regression  coefficient,  the  slope  of  the  regression  line,  is 

h  _  2  (*  ~  *)  (y  ~  y)  _  "2*?-  2*2? 

2  (*-*,'"  »2*2-(2*>* 

and  the  y  intercept  is 

a  —  y  —  bx  =  =-*- - — — 

n 

(See  Sec.  6.1.6  for  formulas  if  there  are  several  values  of  y  for  each  value 
of  x.)  The  general  equation  may  also  be  written  as 

f-y  =  b(x-  x) 

which  shows  that  the  line  passes  through  the  center  of  gravity  of  the 
observed  points.  The  prime  (/)  distinguishes  a  predicted  or  calculated  value 
from  an  observed  value. 
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TABLE  6.1.  Data  for  Linear-Regression  Example 


Propellant 

lot 

Nitrocellulose,  % 

X 

Young’s  modulus 
at  70  °F,  in 
thousands 
of  psi 

y 

1 

55.77 

83.9 

2 

55.05 

66.4 

3 

54.27 

73.1 

4 

50.63 

66.7 

5 

49.86 

30.1 

6 

53-04 

36.2 

7 

51.33 

22.8 

8 

56.70 

66.5 

9 

55.07 

37.0 

10 

55.76 

58.0 

11 

54.40 

71.9 

12 

55.39 

83.1 

13 

57.49 

66.2 

14 

57.56 

72.3 

15 

58.76 

65.8 

16 

59.32 

123.5 

17 

57.21 

116.8 

18 

68.55 

160.1 

19 

65.04 

158.2 

20 

66.98 

152.2 

21 

63.69 

134.8 

22 

58.34 

87.3 

The  regression  coefficients  are 


83,876.437 

11,404.3817 


7.35476 


1,832.9  -  (7.35476)  (1,260.21) 

=  - - - — — — - 1  =  -337.983 

Therefore,  the  equation  of  the  sample  regression  line  of  y  on  x  is 

f  =  -337.98  +  7.3548a- 

This  line  is  plotted  as  the  central  solid  line  on  the  scatter  diagram  of 
Fig.  6.3.  (Other  curves  shown  in  this  figure  will  be  described  in  Sec. 
6.1.2  and  6.1.4e.)  To  check  the  values  for  a  and  b,  substitute  them 
in  the  equation 
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The  sample  regression  line  given  here  is  an  estimate  of  the  true  or 
population  regression  line  of  y  on  x.  The  sample  regression  line  is  such 
that  the  sum  of  squares  of  vertical  deviations  of  the  observed  points  from 
this  line  is  smaller  than  the  corresponding  sum  for  any  other  straight  line; 
i.e.,  the  sample  regression  line  is  a  least-squares  estimate  of  the  unknown 
true  line. 

In  a  Type  I  problem,  although  the  observations  are  made  by  observing 
y  at  values  of  x  chosen  by  the  experimenter,  it  may  be  desired  to  predict 
what  value  of  x  would  yield  a  specified  value  of  y  on  the  average;  this  may 
be  done  by  solving  for  x  in  the  regression  equation  of  y  on  x  given  above. 
In  a  Type  II  problem,  on  the  other  hand,  when  we  know  y  and  wish  to 
predict  x,  the  regression  line  of  x  on  y  is  appropriate  and  would  be  deter¬ 
mined  mathematically  by  minimizing  the  sum  of  squared  horizontal  devia¬ 
tions.  This  line  is  different  from  the  regression  line  of  y  on  x  (unless  all 
points  lie  exactly  on  one  line),  since  the  latter  minimizes  the  sum  of  squared 
vertical  deviations.  The  formulas  are  completely  analogous,  and  can  be 
obtained  by  interchanging  x  and  y  in  the  formulas  given  above.  More 
simply,  the  predicted  variable  can  always  be  labeled  y,  as  in  this  Manual. 

For  some  purposes,  a  satisfactory  line  can  be  fitted  to  the  scatter  diagram 
by  eye,  without  computations. 

Example.  Fit  a  regression  line  of  y  on  x  to  the  data  (Type  II)  of 
Table  6.1.  This  table  gives  22  paired  values  of  %  nitrocellulose  and 
Young’s  modulus  in  thousands  of  psi. 

2  *  =  1,260.21  2  *2  =  72,706.0739 

2  >=  1,832.9  2f  =  187,657.71 

77  =  22  2*7  =  108,805.243 

A  partial  check  of  the  sums  of  squares  and  cross-products  is  given  by 
the  relation 


2  (*  -  y)2  =  2  *2  ~  2  2  *7  +  2  72 

where  the  two  sides  of  the  equation  are  computed  independently. 

42,753.2979  =  72,706.0739  -  2(108,805.243)  +  187,657.71 
=  42,753.2979 
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^xy  =  a^x  +  b^x2 
which  should  hold,  except  for  rounding  error. 

108,805.243  =  (-337.983)  (1,260.21)  +  (7.35476)  (72,706.0739) 
S  108,806.168 


PERCENTAGE  OF  NITROCELLULOSE 


FIG.  6.3.  Graph  for  Table  6.1,  Showing  Regression  Line,  ±j„|,  and  ±2s^t  Limits 
for  Individual  Observation  (Sec.  6.1.2),  and  95%  Confidence  Limits  for  Regression 
Ordinate  (Sec.  6.1.4e). 
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To  illustrate  the  use  of  the  regression  line  in  predicting  values  of  y, 
we  compute  /,  given  x  =  55.77%. 

/  =  -337.98  +  (7.3548)  (55.77)  =  72.20  X  103psi 
The  observed  value  of  y  is  83.9  X  103  psi. 


6.1 .2.  Variation  About  the  Regression  Line 

The  scatter,  in  the  vertical  (y)  direction,  of  the  observed  points  about 
the  regression  line  is  measured  by  swx,  where 


s*  — 
Jytx 


2  iji  ~  y'iY  n  —  1  .  ,  n  —  1 

= - -  (si  -  b2sj)  = - -  J3  1  -  r3) 

«  -  2  a  -  2  '  y  x/  n  -  2  VK  ' 


Either  of  the  latter  two  forms  is  to  be  used  for  computing.  (Section  3.1 
shows  how  to  compute  si  and  j3;  r,  the  sample  correlation  coefficient,  is 
defined  in  Sec.  6.1.3.)  The  quantity  r3ja,  estimates  that  part  of  the  variance 
of  y  left  unexplained  by  the  regression  of  y  on  x.  It  is  defined  with  n  —  2 
rather  than  n  —  1  in  the  denominator  (Sec.  1.1.12)  because  two  degrees  of 
freedom  are  absorbed  by  the  estimates  a  and  b.  The  positive  square  root, 
sy\x,  is  sometimes  called  the  standard  error  of  estimate. 

The  sum  of  squares  of  deviations  of  the  y\  from  their  over-all  mean  y 
can  be  separated  into  two  parts 


2  (*  -  y)-  =  2  Ofi  -  /i)2  +  2  (/>  -  y)- 

That  is,  the  total  sum  of  squares  equals  the  sum  of  squares  of  the  devia¬ 
tions  from  the  regression  line  plus  the  sum  of  squares  of  the  deviations 
of  the  regression  values  from  y.  This  separation  into  two  parts  is  similar 
to  that  described  in  the  analysis  of  variance  in  Chapter  5.  If  we  call 
sl'  ~  2  0''  ~y)~/{n  ~  1)  variance  accounted  for  by  the  regression, 
then  (neglecting  only  the  slight  difference  between  -  1  and  n  —  2)  we 
can  write  the  total  variance  s-y,  like  the  total  sum  of  squares,  as  the  sum 
of  two  parts 


In  a  Type  II  model,  a  similar  relation  holds  exactly  for  the  corresponding 
population  variances 


q.2  -j-  q.2 
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This  separation  into  parts  is  discussed  further  in  Sec.  6.1.3. 

If  it  is  assumed  that  the  y  populations  for  the  various  x’s  have  normal 
distributions  about  the  true  regression  line  with  common  standard  devia¬ 
tion  <tv,x,  then  68%  of  the  combined  populations  lie  within  ±<r„|*  &e 
true  regression  line  and  95%  within  ±2a^x.  With  a  large  sample  we  expect 
to  approximate  this  situation,  taking  the  sample  regression  line  and  sy,x 
as  good  approximations  to  the  true  regression  line  and  <r^x,  respectively. 
For  a  more  precise  description  of  the  distribution  of  the  population  about 
the  regression  line,  see  Sec.  6.1. 4f;  see  also  Ref.  6.14  on  tolerance  intervals 
for  the  regression  line.  (Tolerance  intervals  are  defined  in  Sec.  1.1.11.) 

Example.  For  the  data  of  Table  6.1,  we  calculate 


24.6848 

1,664.3879 


f2  =1 
•v  „ 


~2  (4  ~  £24)  =  1.05(329.126)  =  345.582 


Jy |,  =  18.59  X  103psi  and  sv  =  40.80  X  10s  psi 

Lines  are  drawn  on  Fig.  6.3  at  vertical  distances  of  ±sv |T  and  ±2j„|X 
units  from  the  regression  line.  We  see  that  16  of  the  points  (73%)  lie 
within  ±sUiX  of  the  regression  line,  and  that  all  22  points  (100%) 
lie  within  ±2 sytx. 


6.1.3.  Fraction  of  y  Variance  Accounted  for  by 
Regression — Correlation  Coefficient  r 

From  Sec.  6.1.2  it  follows  that  the  fraction  of  the  population  y  variance 
accounted  for  by  the  regression  on  x  is 

a%'  _  .  _  av\x 

The  square  root  of  this  fraction  is  the  population  correlation  coefficient 
p;  the  sign  is  taken  as  positive  or  negative  to  agree  with  the  sign  of  the 
slope  of  the  population  regression  line.  (A  negative  correlation  indicates 
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that  high  values  of  y  are  associated  with  low  values  of  x.)  The  best  estimate 
of  p  from  the  sample  is  the  sample  correlation  coefficient 

r  =  2(*-2)(y-y) 

(n  -  1)  Vv 

_ _ «2*>  ~  2*2) _ 

""  Vt«2*2-  (2 *)2]  [» 2  f  -  (2 j)2] 

If  the  regression  coefficient  has  already  been  calculated,  we  can  use  the  form 


The  correlation  coefficient  always  lies  between  —  1  and  + 1.  If,  and  only  if, 
all  points  lie  on  the  regression  line,  then  r  —  ±1.  If  r  —  0,  the  regression 
does  not  explain  anything  about  the  variation  of  y,  and  the  regression  line 
is  horizontal  (/  =  y). 

The  correlation  coefficient  is  defined  primarily  for  Type  II  problems, 
where  r  is  an  estimate  of  the  true  correlation  coefficient  p  of  the  joint 
two-dimensional  normal  distribution  (Fig.  6.2),  irrespective  of  the  random 
set  of  x’s  obtained.  The  correlation  coefficient  does  not  have  such  an 
interpretation  in  Type  I  problems,  where  the  regression  coefficient  b  is 
ordinarily  used;  however,  it  is  formally  correct  and  sometimes  convenient 
to  use  the  notation  r  and  some  of  the  associated  formulas. 

Example.  For  the  data  of  Table  6.1,  we  calculate 

2  = _ (”2*?-  2*2))2 _ 

["2*2-(2*)2][«2f  ~(2))2] 

(83,876.437)* 

“  (11,404.3817)  (768,947.21)  ~  °-802254 
r  =  0.8957 

We  see  from  r2  that  the  regression  of  y  on  x  accounts  for  about  80%  of 

the  variance  of  y. 

6.1.4.  Reliability  of  Regression  Measures 

The  significance  tests  and  confidence  intervals  given  below  are  exact  only 
under  the  normality  assumptions  discussed  in  Sec.  6.0.  With  the  exception 
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of  Subsec.  b,  all  of  Sec.  6.1.4  is  applicable  to  Type  I  problems,  but  the 
test  in  Subsec.  a  is  identical  with  a  special  case  of  Subsec.  c,  so  that  only 
Subsec.  c  to  f  need  be  considered  in  Type  I  problems.  All  of  this  section 
is  applicable  to  Type  II  problems,  Subsec.  a  again  being  equivalent  to  a 
special  case  of  Subsec.  c.  (The  tests  of  hypotheses  in  Subsec.  c  and  d  have 
somewhat  different  operating  characteristic  (OC)  curves  in  Type  I  and 
Type  II  problems,  but  the  difference  is  not  a  practical  concern.  See  Sec. 
1 . 1 .8-1 . 1 .9  for  OC  curves.) 

a.  Significance  of  Sample  Correlation  Coefficient  r.  Appendix  Table 
7  gives  1  and  5%  critical  points  for  the  absolute  value  of  r,  denoted  by  jr|. 
We  read  the  critical  value  under  two  variables  and  opposite  n  —  2  degrees 
of  freedom.  If  the  computed  |r|  exceeds  the  critical  value,  we  reject  at  that 
level  of  significance  the  null  hypothesis  that  the  population  of  x’s  and  y’s 
has  zero  correlation. 

Example.  For  the  data  of  Table  6.1,  r  =  0.896.  Suppose  we  wish 
to  test  the  hypothesis  that  there  is  zero  correlation  between  the  %  nitro¬ 
cellulose  and  Young’s  modulus  at  the  5%  level  of  significance.  From 
Appendix  Table  7,  under  two  variables  and  opposite  22  -  2  =  20 
degrees  of  freedom,  we  read  0.423  for  the  5%  critical  point.  Since 
0.896  exceeds  0.423,  we  reject  the  null  hypothesis.  As  explained  in 
Sec.  1.1.7,  the  hypothesis  is  rejected  because  if  it  were  true  such  a  high 
value  of  jrj  would  occur  with  probability  less  than  5%.  The  test  is  an 
equal-tails  test,  appropriate  when  we  are  interested  in  either  positive  or 
negative  correlation,  as  is  usually  the  case. 

b.  Confidence  Interval  for  Population  Correlation  Coefficient  p. 

This  subsection  applies  only  to  Type  II  problems.  With  the  help  of  Appen¬ 
dix  Chart  XI  we  can  construct,  on  the  basis  of  the  correlation  coefficient  r 
from  a  sample  of  size  a  confidence  interval  for  the  true  (population) 
correlation  coefficient  p.  In  95%  of  our  experiments  the  interval  so  con¬ 
structed  will  contain  p. 

Example.  For  the  data  of  Table  6.1,  ;•  =  0.896.  Construct  a  95% 
confidence  interval  for  p. 

Entering  Appendix  Chart  XI  with  this  abscissa,  we  obtain  for  sample 
size  22  (interpolating  between  curves  labeled  20  and  25)  the  95% 
confidence  interval  0.75  to  0.96. 
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c.  Significance  Test  and  Confidence  Interval  for  the  Slope  b  of 
the  Regression  Line.  The  standard  error  or  the  regression  coefficient  b 
can  be  estimated  from  the  sample  as 


where 


and 


>i/  — 


■f*V  n  ~ 1 


•f; 


l 

n(n  -  1) 


sl\*  n  -  2  '  ^s*) 

To  test  the  null  hypothesis  that  the  slope  (3  of  the  true  or  population 
regression  line  has  any  stated  value,  say  B,  compute 

Jo-B 

Reject  the  null  hypothesis  at  the  significance  level  a  if  j/j  exceeds  the 
critical  value  ta/ 2>n_2  given  in  Appendix  Table  3  for  P{t)  -  a/2  and 
f  —  n  -  2  degrees  of  freedom  (Ref.  6.2,  pp.  402-3  and  548-51). 

A  100  (1  —  a)%  confidence  interval  for  the  true  slope  (3  is 

b  ~  ^a/2,M"2J6 

Example.  For  the  data  of  Table  6.1,  test  the  null  hypothesis  that 
(3  has  the  value  B  =  0. 

We  compute 


h 


18.5898 


J,V»~  1  V" (24.6848)  (2!) 


-  =  0.81649 


and 


b  7.35476 
sb  ~  0.81649 


9.008 


Since  Appendix  Table  3  gives  /  0 25>20  =  2.086,  we  find  the  computed  t, 
and  hence  b ,  significantly  different  from  zero  at  the  5%  level  of 
significance. 
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To  find  a  95%  confidence  interval  for  the  true  slope  /?,  we  compute 

b  ±  /  o25,2oJ6  =  7.35476  ±  (2.08 6)  (0.81649) 

We  find  a  95%  confidence  interval  of  5.652  to  9.058. 


d.  Significance  of  the  Difference  Between  the  Regression  Coeffi¬ 
cients  Ox  and  b2  of  Two  Separate  Equations.  If  two  lines  y[  =  at  +  bxx 

and  /2  =  a2  +  b2x  have  been  determined  from  two  sets  of  data  with 
comparable  variables,  the  /  test  can  be  used  to  test  the  significance  of  the 
difference  between  the  slopes,  under  the  assumption  that  the  population 
standard  errors  of  estimate  for  the  two  relations  are  equal.  (This  assump¬ 
tion  can  be  tested  as  in  Sec.  3.3  by  forming  the  F  ratio  of  the  two  values 
of  s'^x,  say  4i|x  and  s2yAx,  and  comparing  it  with  the  critical  value  in 
Appendi::  Table  5  for  nx  —  2  and  n2  —  2  degrees  of  freedom.) 

We  compute  the  best  estimate  syiX  of  the  common  variance  a2v  t 

,2  _  (”l  ~  2)  Sli\x  ~  SVi\ J 

s»  x  ~  »1  +  n2-  4 

Based  on  the  pooled  information  from  both  samples,  the  estimate  of  the 
variance  of  the  regression  coefficient  bt  is 


t’2 


('  =  1.  2) 


and  the  standard  deviation  of  the  difference  bx  —  b2  is  estimated  from 


=  <+ 


O  — 


h: 


V\x 


(«i~i )4,  (»2“  l)sl. 


To  test  the  null  hypothesis  that  the  difference  /?x  -  /?2  between  the  true 
(population)  regression  coefficients  has  any  stated  value,  say  A,  compute 


bx  —  b2  -  A 

/  =  -i— - 

sbi~b2 

Reject  the  hypothesis  at  'he  significance  level  a  if  |/|  exceeds  the  critical 
value  ta/2tni+n2-4  given  in  Appendix  Table  3. 

For  the  comparison  of  more  than  two  regression  coefficients  see  Ref.  6.4, 
Chap.  12,  or  Ref.  6.7,  Sec.  18.9. 
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e.  Confidence  interval  for  an  Ordinate  to  the  True  Regression  Line. 

The  ordinate  of  the  sample  regression  line  for  any  given  x  =  X  (which 
need  not  be  any  of  the  observed  x's )  is  calculated  as 


f  =  a  +  bX 

This  f  differs  from  the  true  or  population  mean  ordinate  at  x  =  X  which 
would  be  obtained  if  infinitely  many  observations  could  be  made  with  the 
same  value  X  of  x.  We  can  show  how  good  our  estimate  /  of  the  true 
mean  ordinate  is  by  calculating  100(1  —  a)%  confidence  limits 


/ 


^a/ 2,n-2  *y\x 


1 

ix\n 


+ 


(XzSL 

(n  -  l)^ 


where  ta/2t„. 2  is  obtained  from  Appendix  Table  3.  In  this  way  we  can 
construct  a  confidence  interval  for  any  particular  ordinate  of  interest.  Notice 
that  if  X  is  set  equal  to  x  in  the  equation  y*  —  a  +  bX,  we  have  /  =  y. 
If  X  =  0,  then  /  =  a,  the  y  intercept. 


To  make  an  equal-tails  test  of  the  null  hypothesis  that  the  ordinate  to  the 
true  regression  line  at  the  chosen  X  has  any  stated  value  Y,  we  compute 


/  = 


f-Y 


(X  ~  *)2 

(n  -1)4 


We  reject  the  hypothesis  at  the  significance  level  a  if  |/|  exceeds  the  critical 
value  /a/o.n-a  given  in  Appendix  Table  3. 


Example.  Using  the  data  of  Table  6.1,  calculate  a  95%  confidence 
interval  for  the  ordinate  to  the  regression  line  of  Fig.  6.3  for  the  percent¬ 
age  x  =  X  of  nitrocellulose. 

We  compute 


a;/  -4-  /  f 

/  *a/L\n-2  Jy\x 


(x  -  *4 

(>/  -  i)4 


with  n  —  2  2,  x=  57.28,  ^==18.59,  {n  -  1)4  =  5 18.38,  and 
/  =  —337.98  +  7.3548X.  From  Appendix  Table  3,  we  find  /  0 25,2o  = 
2.086.  Hence  the  95%  confidence  limits  for  the  ordinate  are 
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-  337.98  +  7.3548X  ±  (2.086)  (18.59)  ^ 

=  -337.98  -!-  7  1548X  ±  YWTiX^'Wj.lf 

These  limits  are  the  curved  lines  plotted  in  Fig.  6.3.  Note  that  the  error 
in  estimating  the  true  mean  ordinate  is  smallest  at  X  =  x,  and  increases 
steadily  as  jX  —  x|  increases. 

f.  Prediction  Interval  for  Individual  Value  of  y.  For  any  given  x 
the  individual  values  of  y  are  scattered  above  and  below  both  the  true  and 
the  sample  regression  lines.  In  practice,  it  may  more  often  be  of  interest  to 
know  how  ciosely  one  can  predict  an  individual  value  of  y  rather  than  the 
mean  value  given  by  the  regression  line  (Sec.  6.1.4e).  The  individual  value 
of  y  corresponding  to  a  given  x  =  X  is,  as  in  Sec.  6.1.4e,  calculated  as 

/  =  a  +  bX 

The  formula  for  a  100(1  —  a)%  prediction  interval  for  y  (explained 
below)  is 

f 

yf  ■+■  e  r  fi  _i_  *  I  (X  ~~  x)2 

To  see  what  such  an  interval  means,  suppose  that  we  conduct  an  experiment 
of  n  observations  and  construct  an  interval  for  the  value  of  y  corresponding 
to  some  given  x  =  X;  and  suppose  that  we  take  a  single  further  observation 
yn+1  at  x  ~  X.  The  length  of  the  prediction  interval  is  determined  so  that 
in  many  repetitions  of  this  whole  procedure,  on  the  average  100(1  —  a)% 
of  the  intervals  will  contain  the  corresponding  additional  observation  yn+1. 
(See  Ref.  5.12.) 

Example.  For  the  data  of  the  example  in  Sec.  6.1.4e,  the  95%  predic¬ 
tion  interval  for  a  single  observation  y  at  x  =  X  =  60  is 

-337.98  +  7.3548X  ±  38.78  ^1  +  —  + 

=  -337.98  +  7.3548X  ±  572.2  +_2.901  (X  -  57^28^ 


=  103.31  ±  V  1593.7  =  103.31  ±  39.92 
=  63.39  and  143.23 
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Evidently  a  single  value  of  y  (Young’s  modulus)  can  be  predicted  but 
poorly;  this  is  because  of  intrinsic  variability  from  sample  to  sample 
rather  than  lack  of  knowledge  of  the  true  regression  line,  as  shown  by 
the  relative  sizes  of  the  square-root  quantities  here  and  in  Sec.  6.1.4e. 


6.1.5.  Summary  of  Computations  for  Two-Variable 
Linear  Regression 

We  show  below  the  calculations  for  the  data  of  Table  6.1.  Accumulated 
multiplication  on  a  desk  computer  should  be  used  to  find  sums  and  sums  of 
squares  simultaneously. 

1 .  Sums  and  variances 


2  *  =  1,260.21  2  *2  =  72,706.0739 

2>=  1,832.9  2f  =  187, 657.71 

n  =  22  2*7  =  !08, 805.243 

n^38n 

*  n(n-  1)  (22)  (21) 

=  =  768,947.21 

«(;<-  1)  (22)  (21) 

2.  Recession  line 
/  =  a  +  bx 

b  =  >‘'2.XJ-2X2J  =  83,876.437  = 

»2*!-(2>)!  11,404.3817  3 

2  y-b^x  1,832.9  -  f7. 35476)  /1. 260.21) 

. _  r, _  _  *-4  /  <—<  _  >  -  -  \  _ /  \ _ : _ i 

(l  '  J  (S  SC  "  n  A 

J  n  22 


=  ~337.98 


3.  Correlation  coefficient 


[»2^“CS*-)f]  [«272~(27)23 


_ (83.876.437)2  _ 

(11,404.3817)  (768.947.21) 


=  0.802254 


r  =  V  0.802254  =  0.8957 
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4.  Standard  error  of  estimate 


s2y\x  =  ~2  (sl  -  h^i)  =  1-05  (329.126)  =  345.582 

sylr  =  V  345.582  =  18.5898 

5.  Standard  error  of  regression  coefficient  b 


fb 


i/I  f  — 


18.5898 


=  0.81649 


sx\  n  -  i  \/  (24.6848)  (2i) 

6.  Significance  of  r.  Compare  |r|  with  the  critical  value  in  Appendix 
Table  7  for  2  variables  and  n  —  2  degrees  of  freedom. 

7.  Significance  of  b.  Compare  |/|  =  \b\/sb  with  the  critical  value  in 
Appendix  Table  3  for  n  —  2  degrees  of  freedom. 

For  further  information  on  the  calculations  given  above,  see  Sec.  6.1.4. 


6.1.6.  Test  for  Linearity  of  Regression 

If  the  regression  function  of  y  on  x  is  known  to  be  a  polynomial,  the  tests 
given  in  Sec.  6.3.1  and  6.3.2  can  be  used  to  determine  the  degree  of  that 
polynomial.  In  the  latter  caS’.  the  x  values  used  must  be  equally  spaced. 
If  the  degree  of  the  polynomial  is  one,  the  relation  is  linear. 

A  cruder  test  of  linearity  can  be  made  by  considering  the  sequence  of 
signs  of  the  deviations  y\  —  y\  from  the  fitted  regression  line  in  order  of 
increasing  xj(  and  applying  the  run  test  for  randomness  described  in  Sec. 
4.1.  For  example,  in  Fig.  6.3  the  sequence  of  signs  of  y,  —  y\  is 

+  +  —  —  +  +  —  —  +  —  +  —  +  — 

with  10  -F’s,  12  — ’s,  and  12  runs.  If  the  population  regression  curve  is 
non-linear,  the  number  of  runs  usually  tends  to  be  less  than  if  it  were  linear. 
For  a  test  of  linearity  at  the  5%  significance  level  we  compare  12  with  the 
critical  value  vm05  =  7  in  Appendix  Table  10  for  nx  =  10  and  n.,  =  12.  Thus 
we  do  not  reject  the  hypothesis  of  linearity.  The  same  test  can  be  applied  to 
any  fitted  curve,  not  merely  a  straight  line. 

A  much  better  test  of  linearity,  or  of  any  form  of  curve,  is  available  in  a 
Type  I  model  if  more  than  one  observation  of  y  is  made  for  each  selected 
value  of  x.  An  F  test  can  be  made  of  the  variance  about  the  regression 
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relative  to  the  variance  within  groups,  similar  tG  the  methods  of  Chapter  5. 
Let  the  nx  values  of  y  for  each  chosen  x  value  x*  be  called  y^,  where 
/  =  1,2,  •  •  •  ,  ni  and  ;  =  1,  2,  •  •  •  ,  m  >  2. 

To  test  for  a  general  linear  relation  between  x  and  y 

y  =  a  +  fix 

where  the  values  of  a  and  /?  are  not  specified,  calculate  the  m  means 


1 

h  =  “'Shi/  (/  =  1,2,  •  •  •  ,m) 

"i 

and  the  m  estimates  of  variance 


'ft 


i= i 


(;=  1,2, 


,m) 


These  estimate  the  variances  of  the  j*/ s  for  each  x*.  In  the  regression 
model  these  variances  are  all  assumed  equal  (to  This  assumption  can 
be  tested  as  in  Sec.  3.4.  The  m  estimates  of  variance  s ft  can  be  combined 
to  give  an  estimate  of  the  common  variance  oft. 


..  =  2(»i  ~  l)s‘U 
S]  N-m 

vith  N  -  m  degrees  of  freedom,  where  N  =  2  Here,  and  throughout 
the  following  summations,  the  index  of  summation  is  i  and  the  range  (from 
1  to  //;)  will  be  omitted  for  the  sake  of  simplicity. 

The  constants  a  and  /?  are  estimated  by  the  least-squares  line 

/  =  a  4-  bx 

fitted  to  the  data,  where 

l  =  ~  (£">*>)  (2>Ui) 

N2»,xf- 

and 

2®j«  _  *2»i*i 

,  =  - 

(These  follow  directly  from  Sec.  6.1.1  by  denoting  the  equal  values  of  x 
by  the  same  symbol.)  Using  the  values  /  given  by  this  least-squares  line, 


% 


■K 

& 

5 

vf 

v 


1 
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we  form 

2  »i  (Ji  “  /i)2 

-  ~m~-2  " 

_  N2fiiTi  ~  ~  *  [NE»iyiji  " 

N(m  -  2) 

with  in  —  2  degrees  of  freedom.  The  latter  form  is  preferred  for  computing. 
If  the  hypothesis  of  linearity  is  true ,  s\ ;  also  estimates  <r-y.x\  if  not ,  jH  tends 
to  exceed  To  test  the  hypothesis  of  linearity,  compute  the  F  ratio 


Reject  the  hypothesis  at  the  5%  significance  level  if  the  calculated  F  is  larger 
than  the  critical  value  F0ft(w  —  2,  A7  —  in)  in  Appendix  Tabic  5. 

If  the  hypothesis  of  linearity  is  not  rejected ,  the  significance  tests  and 
confidence  intervals  for  b  and  any  particular  /  (such  as  a)  can  be  made  as 
in  Sec.  6.1.4  on  ungrouped  data,  except  that  s'^r  is  replaced  by  s j  and  the 
number  of  degrees  of  freedom  n  -  2  is  replaced  by  N  -  in. 

If  the  hypothesis  of  linearity  is  rejected ,  then  some  other  less  simple  form 
of  relation  must  1  *;  considered.  The  data  may  be  used  to  discover  such  a 
relation,  but  rigorous  statistical  tests  of  the  validity  of  the  relation  suggested 
by  the  data  cannot  be  made  with  the  same  data.  An  entirely  new  set  must 
be  gathered. 

See  Ref.  6.7  (Sec.  18.3,  18.4,  and  18.7)  for  a  more  extensive  discussion 
of  linearity  tests. 

The  test  has  much  more  general  application  than  that  described  above. 
It  can  be  used  with  only  slight  changes  to  test  any  form  of  relation  in  which 
f  is  linear  in  the  constants  to  be  determined;  e.g.,  for  testing  a  polynomial 
in  x.  The  only  changes  are  the  following:  (l)  the  equation  of  interest,  rather 
Ui*n  a  straight  line,  is  fitted  by  least  squares  as  in  Sec.  6.2  or  6.3;  and  (2)  if 
the  number  of  constants  to  be  determined  is  k  +  1  rather  than  2  (where 
k  -r  1  <  in),  then  m  —  k  —  1  replaces  m  —  2  as  the  number  of  degrees  of 
freedom  in  the  denominator  of  the  first  expression  for  rn,  and  the  second 
expression  for  s2  does  not  apply. 
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6.2.  Multiple  Linear  Regression 

If  the  variable  y  depends  linearly  on  several  factors,  it  is  best  to  use  a 
formula  involving  all  of  them  for  predicting  y.  Denote  the  k  such  factors,  or 
variables,  by  xu  x2,  •  •  •  ,  x*.  A  sample  of  n  observations  would  consist  of 
n  sets  of  values:  xn>  x21,  •  •  •  ,  xkl)  yx;  x12,  x,2,  •  •  •  ,  xk2,  y2\  •  •  •  ;  and 
,vu„  x2n,  •  •  •  ,  xkn,  yn.  The  subscript  denoting  the  particular  observation 
is  usually  omitted,  and  the  representative  factor  is  denoted  by  x;-  where 
/'  =  1,  2,  •  •  •  ,k. 

The  general  discussion  of  the  meaning  of  the  regression  equation  in 
Sec.  6.0  and  6.1  applies  to  multiple  as  well  as  simple  regression.  If  the 
whole  population  of  values  of  (x,,  x2,  •  •  •  ,  xk ,  y)  were  known,  the  ordinate 
to  the  true  or  population  regression  plane  at  the  x  point  (xu  x2,  •  •  •  ,  xk) 
would  be  the  y  mean  for  that  x  point  (Fig.  6.4).  The  values  of 
xu  x2,  •  •  •  ,  xk  may  be  specified  by  the  experimenter  or  they  may  occur 


y 


FIG.  6.4.  Regression  Plane. 
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randomly,  giving  rise  to  the  Type  I  and  Type  II  problems  mentioned  in 
Sec.  6.0.  Intermediate  mixed  types  may  occur.  The  restrictions  on  Type  I 
and  Type  II  models  in  the  multiple-regression  case  are  analogous  to  those 
in  the  simple-regression  case.  In  any  case,  each  sample  value  of  y  is  assumed 
to  be  a  random  observation  from  a  normal  population.  These  populations 
have  a  common  standard  deviation,  but  different  means,  depending  linearly 
on  *i,  x2,  •  •  •  ,  x*. 

The  methods  of  this  section  presuppose  a  linear  relationship  between  the 
mean  value  of  y  and  the  other  variables.  A  rough  check  on  this  assumption 
can  be  made  by  graphing  y  against  the  x’s  taken  one  at  a  time  (see  Ref.  6.5, 
C  11,  14,  and  16).  A  general  test  of  a  regression  surface,  which  requires 
several  observations  of  y  for  each  point  Xj,  x2,  •  *  •  ,xk,  is  available  in 
Sec.  6.1.6. 

If  the  regression  curves  seem  to  have  a  form  other  than  linear,  a  trans¬ 
formation  of  variables  may  yield  a  linear  relationship  which  would  make 
the  methods  of  this  section  applicable.  However,  if  the  forms  of  the 
curves  are  discovered  from  the  data  under  analysis,  they  represent  a 
hypothesis  rather  than  an  established  fact  and  should  be  tested  with  a 
further  experiment. 

6.2.1.  Multiple  Linear  Equation 

The  predicted  value  /  is  expressed  by  the  sample  regression  equation 
./  -  a  +  b1x1  +  b2x2  +  •  •  •  +  bkxk 

where  the  x;-  are  the  variables  affecting  y,  the  partial  regression  coeffi¬ 
cients  bj  are  determined  as  shown  in  Sec.  6.2.2,  and 

a  —  y~  b1x1  —  b2x2  —  •  •  •  -  bkxk 

the  sample  averages  being  y,  x„  x._,,  •  •  •  ,  xk.  Thus  the  regression  equation 
could  also  be  written 

/  “  y  =  bx(xx  ~  xx)  +  b,(x2  -  x2)  +  •  •  +  bk(xk  -  xk) 

This  equation  represents  the  best-fitting  plane  for  the  sample  data  plotted 
in  k  +  1  dimensions,  in  the  sense  of  the  least  sum  of  squares  of  y  deviations. 
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6.2.2.  Determination  of  Partial  Regression  Coefficients 

We  determine  the  partial  regression  coefficient  bj  in  the  regression 
equation  by  solving  the  normal  equations 

A,2(x,  " X'T  +  b^(x,  ~x,)(x--xD  +  •  •  •  +  £*£(x,  -  x,)(x*  ~  x*)  =  ]£(x,  "  *1  )(y  ~  y) 
b,^(x-i  ~  xdS  -  x.)  +  ^2(x:  -  xj-  +  •  •  •  4  bt^(x-.  ~  x2)(x„  -  x*)  -  2(x*  -  x,)(>  -  y) 


bi^ixk  ~  x*)(x,  _  x.)  F  /^(x*  _  **)(xs  "  x2)  F  •  •  •  +  (x*  ~  x*)'  -  2  (x*  ~  x*)(>'  "  >) 

Each  sum  has  n  terms  for  the  n  sample  members.  Thus 

2  (*>  ~  *\)-  -  2  (*w  -  *i)2 

tr  J 

but  the  second  subscript  has  been  omitted  from  these  equations  for 
convenience. 

To  simplify  the  notation,  we  let 

‘i hi  =  »2  (*/<  ”  */.)  (*/  “  */)  =  »  2  */.*;  ~  2  **  2  */ 

Note  that 

<4/  =  *jh  (e.g.,  <*2S  =  <*32  =  »  2  *2*3  -  2  *2  2  *a) 

and 

=  »  2  x)  ~  (2  *,)2  (e.g.,  <*u  =  »  2  *1  “  (2  *1)") 

Also  let 

<lh  =  »  2*;)’  -  2  */  2  7  (e-g.,  *2V  =  «  2  *27  ~  2  *2  2  7) 

and 


*vv 


=  n  2  f 


(27  )2 


With  this  notation  the  correlation  coefficient  rhj  be»'w*en  xh  and  x;-  can 
^e  written 


rhi  = 


_ ahj 

V  ^hh.dj  j 
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For  ease  in  computing,  we  consider  the  normal  equations  multiplied  by  n 
and  written  in  the  form 

*11^1  +  *12^2  +  '  '  -  +  *lA  ~  *1}/ 

*21^1  +  *22^2  +  •  ■  ■  +  a2kbk  =  a2v 


<**A  +  ak2b2  +  •  •  •  +  akkbk  =  aky 

References  6.7  (pp.  642-49)  and  6.9  give  methods  for  solving  simul¬ 
taneous  linear  equations.  Any  familiar  method  may  be  used,  but  a  large- 
scale  problem  (more  than  three  independent  variables,  or  many  repetitions 
with  three  or  fewer)  may  be  solved  most  readily  by  high-speed  machine 
methods.  The  technique  most  readily  adapted  to  machine  methods  and  to 
assessing  the  precision  of  the  results  is  that  of  inverting  the  matrix  of 
coefficients  ahj.  This  involves  solving  for  the  b's  by  replacing  the  column 

of  constants  aju  successively  with  1,  0,  0,  •  •  •  ,  0,  getting  values  en, 

*12,  '  •  •  >  *i*;  with  0,  1,  0,  •  •  •  ,  0,  getting  e2u  e22,  •  •  •  ,  e2k\ 
and  with  0,  0,  •  •  •  ,  0,  1,  getting  eku  ek2,  •  •  •  ,  ekk.  For  example,  en, 
el2,  '  '  '  ,  *i/c  satisfy  the  equations 

<*n*ii  +  <*i2*i2  +  •  •  •  +  alkeik  =  1 

*21*11  +  *22*12  +  •  •  •  +  a2kexk  =  0 

*31*11  +  *32*12  +  •  •  *  +  a2kexk  -  0 


*fcl*ll  +  **2*12  +  •  •  *  +  *fcfc*l*  -  0 

Then  the  solution  of  the  original  equations  is 

k 

bj  2  ahyeh j  {j  T2,  ‘  ‘  ‘  ,  k) 

/l  =  l 

Since  the  ahj  are  the  same  in  all  cases,  an  efficient  computational  form,  such 
as  the  one  shown  in  Ref.  6.7  (pp.  648-49),  enables  us  to  obtain  all  the  ehj 
and  the  b-}  simultaneously,  and  provides  easy  and  valuable  step-by-step 
checks. 

Example.  Rocket  propellant  grains  are  f .  oduced  by  extrusion  through 
a  die  under  pressure.  The  major  inside  diameter  y  depends  not  only  on 
the  die  shape,  but  also  on  the  kind  of  powder,  its  initial  temperature  x1( 
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the  die  temperature  x^,  and  the  extrusion  rate  x;t.  Experiments  have 
indicated  that  the  average  influence  of  x,,  x2,  and  x;t  on  y  can  be 
adequately  represented  by  a  linear  equation 

/  =  a  4  £,xt  +  b2x2  +  b;tx;i 

Table  6.2  lists  19  observations  taken  on  a  particula;  type  of  grain 
for  the  purpose  of  evaluating  a ,  bu  b2,  and  b:i.  The  values  of  the  x} , 
except  for  small  variations,  are  110  and  130°E  for  x,;  120,  140,  and 
158°F  for  x.;  and  10,  20,  and  28  in/min  for  x:t.  Thus,  the  problem  is 
better  classified  as  Type  I  than  as  Type  II. 


TABLE  6.2.  Data  for  Multiplh-Rhgrfssion  Example 


Powder  temp. 

-  100,  °F 

Die  temp. 

-  100,  °F 

Extrusion  rate 
in/min 

[ID(in.)-l.l]l0' 

1 - 

\y  +  2  x< 

Xi 

** 

Xi 

y 

.  ...... 

11 

58 

11 

126 

206 

32 

20 _ 

13 

92 

157 

14 

22 

28 

108 

172 

26 

55 

28 

119 

228 

9 

41 

21 

103 

174 

30 

18 

20 

83 

151 

12 

56 

20 

113 

201 

29 

40 

26 

109 

204 

7 

38 

9 

130 

184 

28 

57 

10 

106 

201 

10 

19 

19 

104 

152 

31 

37 

18 

92 

178 

12 

21 

10 

94 

137 

33 

40 

1 1 

95 

179 

9 

42 

27 

109 

187 

12 

57 

29 

103 

201 

10 

21 

12 

82 

125 

33 

40 

19 

85 

177 

30 

58 

29 

104 

221 

The  computation  is  simplified  by  subtracting  100°F  from  the  powder 
temperature  and  the  die  temperature,  and  by  subtracting  1.100  inches 
from  the  inside  diameter  and  expressing  this  last  difference  in  thou¬ 
sandths  of  an  inch.  The  column  of  sums  y  4-  2  xj  is  included  to  provide 
an  over-all  check  of  the  calculation  of  the  normal  equations,  as  shown 

t  1 

in  Table  6.3.  The  check  is  given  by  the  identity 
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>d(y  +  2xi)2-[i(y  +  Z*i)Y 

\  i  \  ) 

-  aw  +  2  aii  +  2  2  aiv  +  2  2  2  ahf 
)  )  It  <  1 

In  this  example  both  sides  of  the  identity  are  found  by  separate  calcu¬ 
lation  to  be  261,348. 

TABLE  6.3.  Computation  of  Sums  of  Squares  and  Products 


n  —  19. 


Quantity 

Xi 

X- 

■ 

y 

>  +  2* 

2* 

378 

740 

360 

1,957 

3,435 

2*5 

9,424 

32,896 

7,778 

204,845 

634,767 

an 

36,172 

77,424 

18,182 

62.206 

261,348 

14,860 

7,222 

37,865 

a,/ 

2,620 

1,138 

-20,311 

2  XlX> 

14,505 

78,213 

au 

9,195 

1 

37,867 

thy 

37,247 

3,173 

\  Check  sum: 

}  261,348 

x, 

19.89 

38.95 

18.95 

103.00 

From  Table  6.3,  the  normal  equations  for  this  example  are 

36,1726,  +  2,6206,  +  1, 13863  =  -20,311 

2,6206!  +  77,42 46,  +  9,1956s  =  37,867 

1,1386-  +  9,1936,  +  18,1826,  =  3,173 

By  first  (or  simultaneously)  solving  the  like  equations  in  the  ehj’s  by  an 
appropriate  method,  we  obtain  the  results 


?u  =  27.7444  X  10-" 
?,i  =  -0.7794  X  10-" 
?31  =  -  1.3423  X  i0-« 
6  =  -0.597290 


<?12  =  -0.7794  X  10  " 
e,,  =  13.7631  X  10-" 
eM  =  -6.9115  X  10-" 
6,  =  0.515068 


el3  =  -1.3423  X  10'" 
<?,3  =  -6.9115  X  10-" 
<?33  =  58.5787  X  10-" 
63  =  -0.048583 


We  can  then  write  the  regression  equation  in  the  form  of  the  last 
equation  of  Sec.  6.2.1,  using  the  mean  values  at  the  bottom  of  Table  6.3 
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/  -  103.00  =  -0.5973(x,  -  19.89)  +  0.5151  (x2  -  38.95) 

-  0.0486  (x,  -  18.95) 

Multiplying  and  collecting  terms,  we  can  write 

/  =  95.74  -  0.5973x,  +  0.51 5  lx2  -  0.0486x:i 

The  value  of  the  original  dependent  variable,  the  major  inside  diameter 
in  inches,  is  predicted  for  given  values  of  xu  x2,  and  x:t  by  dividing  the 
/  calculated  from  the  above  equation  by  1,000  and  adding  1.1.  If  the 
equation  were  to  be  used  frequently,  a  nomogram  could  be  constructed 
to  give  the  results  in  chart  form  (Ref.  6.1).  If  x,  =  11,  x2  =  58,  and 
x;t  =  11,  then  /  =  118.51  and  the  predicted  major  inside  diameter  is 
1.2185  inches.  This  predicted  value  may  be  compared  with  the  first 
observed  value,  1.226  inches  (coded  as  126)  in  Table  6.2.  The  regression 
equation  gives  a  prediction  of  the  average  value  of  y  for  the  given  values 
of  x,,  x2,  and  x.„  as  shown  in  Fig.  6.1  and  6.4.  Individual  observations, 
future  as  well  as  past  if  background  conditions  remain  unchanged,  will 
tend  to  cluster  about  the  sample  regression  plane  if  sufficient  data  are 
used  to  make  it  a  good  approximation  to  the  population  regression  plane. 
The  usefulness  of  the  equation  is  examined  in  the  following  sections. 


6.2.3.  Variation  About  the  Regression  Plane 

The  concepts  associated  with  multiple  regression  analysis  are  to  a  great 
extent  similar  to  those  associated  with  a  simple  regression  line.  In  par¬ 
ticular,  the  scatter  in  the  vertical  (y)  direction  of  the  observed  points 
about  the  regression  plane  is  measured  by  the  standard  error  of  esti¬ 
mate  sv\n...h>  where 

It  can  be  calculated  as 


,v\ 


^  (avv  '  d\y  b2a2v 

or,  as  will  be  seen  in  Sec.  6.2.4,  as 

( n  -  1 )  j-  , 


bkdky) 
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where  ry] v,...k  is  the  multiple  correlation  coefficient. 

If  sy  l2...k  is  based  on  a  sufficiently  large  sample,  it  is  a  good  estimate  of 
the  scatter  of  the  population  about  the  true  or  population  regression  plane. 
If  the  deviations  from  the  plane  are  normally  distributed,  about  95%  of 
the  points  in  a  large  sample  will  lie  within  ±2j(,|12...a.  of  the  plane 
(measured  in  the  y  direction). 

Example.  For  the  example  of  Table  6.2,  we  calculate 

=  (7^)  [62,206  -  (-0.597290)  (-20,311) 

-(0.515068)  (37,867)  -  (-0.048583)  (3,173)] 

=  107.81 

r}/|i23=  10.383  X  10"3  inch 
whereas 


2  _  auu _ 62,206 

Sy~  n(n-  1)~  (19)  (18) 

st,  =  13.487  x  l O'3  inch 


181.89 


If  the  deviations  are  approximately  normally  distributed,  about  68%, 
or  13,  of  the  19  observed  inside  diameters  should  lie  within  0.0104  inch 
of  the  values  predicted  from  the  corresponding  x,  values  by  the  regression 
equation;  actually  12  do  so.  Likewise  about  95%,  or  18,  should  lie 
within  0.0208  inch,  and  19  do  so. 


6.2.4.  Relative  Variation  (Type  II  Problem) 

a.  Multiple  Correlation  Coefficient.  Just  as  in  the  case  of  one  inde¬ 
pendent  variable,  the  population  multiple  correlation  coefficient 

squared,  p3;V2...;.,  is  defined  as  the  fraction  of  the  total  variance  of  y  that  is 
accounted  for  oy  its  regression  on  the  variables  xt,  xs,  •  •  ■  ,  and  xk.  The 
best  estimate  of  p-, from  the  sample  is  the  sample  multiple  correla^ 
Hon  coefficient  squared 

bxaxy  +  b2a2y  +  •  •  •  +  bkaky 

rvU  2-k  =  ~a 

“vv 

The  multiple  correlation  coefficient  itself  is  always  taken  as  the  positive 
square  root  of  r3,12...A.,  and  it  is  never  greater  than  one:  0  ^^,,2...^  ^  1. 
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A  value  of  zero  indicates  no  correlation  between  y  and  x;-;  a  value  of  one 
means  that  all  sample  points  lie  precisely  on  the  regression  plane.  Since, 
because  of  random  fluctuations,  ry il2...fc  is  almost  never  zero  even  when 
there  is  no  correlation  between  y  and  the  x;-  in  the  population,  it  is 
important  to  test  its  statistical  significance;  this  is  easily  done  as  shown 
in  Sec.  6.2.5b. 

As  in  the  case  of  simple  correlation  in  Sec.  6.1.3,  the  multiple  correlation 
coefficient  is  appropriate  only  in  Type  II  problems  (in  which  the  variables 
Xj  vary  randomly  rather  than  as  chosen  by  the  experimenter).  In  both 
Type  I  and  Type  II  problems,  the  size  of  sy,12...k  is  used  to  assess  the 
goodness  of  fit  (Sec.  6.2.3). 

No  numerical  example  will  be  given  here  since  the  numerical  data  in 
Table  6.2  are  for  a  Type  I  problem. 

b.  Partial  Correlation  Coefficients.  This  subsection,  like  Subsec.  a,  is 
appropriate  only  for  Type  II  problems.  The  sample  partial  correlation 
coefficient  f„3j124...fc,  for  example,  estimates  the  correlation  between  y 
and  x3  after  the  influences  of  xu  x2,  x4,  ■  ■  ■  ,  and  xk,  as  estimated  from 
the  sample,  have  been  removed.  The  partial  correlation  coefficient  squared, 
approximately  the  fraction  of  the  otherwise  unexplained  part 
of  the  variance  of  y  that  will  be  accounted  for  by  considering  the  remaining 
variable  xs.  The  meaning  of  the  partial  correlation  coefficient  should  be 
sharply  distinguished  from  that  of  the  simple  correlation  coefficient  ry2\  the 
former  systematically  eliminates,  but  the  latter  merely  ignores,  the  variation 
caused  by  xx,  x2,  x4,  •  •  ■  ,  and  xk. 

For  a  three-variable  regression  ( k  —  2)  we  can  calculate  the  partial 
correlation  coefficients  from  the  simple  correlation  coefficients  rhj  by  the 
formula 


_ _ ry2  rmr2i _ 

,"l‘  ~  V(  i  -  (i  -  <•§.) 

merely  interchanging  the  1  and  the  2  to  obtain  ryl |,.  (See  Sec.  6.2.2  for 
calculation  of  the  rhj.)  This  formula  shows  that  ry2  may  be  positive  while 
ry2 |j  is  zero  or  negative,  so  that  little  or  nothing  about  basic  relations  can 
be  concluded  directly  from  the  simple  correlation  coefficient  if  there  is  some 
further  variable  influencing  y.  For  example,  if  rn  —  0.5,  and  rn  =  r2l  =  0.8, 
then  =  -0.389. 
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For  a  four-variable  regression  we  have 

^J/3|12  — 

and  similarly  for  the  other  coefficients.  In  this  formula,  the  three  coeffi¬ 
cients  rhj  (1  on  the  right  must  be  calculated  from  the  six  simple  coefficients  r*/. 

Each  partial  correlation  coefficient  lies  between  ~1  and  +1.  The  inter¬ 
pretation  of  the  coefficient  is  analogous  to  that  of  the  correlation  coefficient 
for  two  variables.  Significance  tests  and  confidence  intervals  can  also  be 
constructed  as  in  the  case  of  two  variables  (Sec.  6.2.5c  and  d).  In  most 
applications  it  is  sufficient  to  calculate  the  partial  regression  coefficients, 
the  bv  and  not  bother  with  the  partial  correlation  coefficients. 

6.2.5.  Reliability  of  Multiple  Regression  Measures 

As  in  simple  regression  (Sec.  6.1.4),  the  significance  tests  and  confidence 
intervals  given  below  are  exact  only  under  the  normality  assumptions 
discussed  in  Sec.  6.0.  Subsections  a,  e,  f,  and  g  are  appropriate  to  Type  I 
problems.  Numerical  examples  are  not  given  for  the  other  subsections, 
since  the  problem  of  Table  6.2  is  better  classified  as  Type  I  than  as  Type  II. 
All  of  this  section  may  be  applied  to  Type  II  problems,  but  Subsec.  b  and  c 
are  equivalent  to  special  cases  of  Subsec.  a  and  e,  respectively.  Problems 
of  mixed  type  may  arise  in  multiple  regression. 

a.  Significance  of  Regression  as  a  Whole.  The  regression  plane 
could  fail  to  be  statistically  significant  because  (1)  the  assumed  form  of 
the  equation  is  not  the  true  form  (i.e.,  the  true  regression  surface  is  not  a 
plane),  or  (2)  the  variation  of  observed  points  about  the  fitted  plane, 
though  random,  is  so  large  that  the  fitted  plane  could  have  arisen  by 
random  sampling  from  a  population  with  all  the  partial  regression  coeffi¬ 
cients  equal  to  zero. 

We  caii  apply  the  method  of  Sec.  6.1.6  to  test  the  linearity  of  the 
regression  relation  if  several  observations  of  y  are  made  for  each  combina¬ 
tion  of  values  of  xu  x2,  •  •  •  ,  xk. 

We  can  test  the  hypothesis  that  all  true  partial  regression  coefficients 
equal  zero  by  an  F  test  of  the  variance  accounted  for  by  regression,  relative 
to  the  error  variance  s*,12... k 
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D  _  l*i  2  (xu  -  *i )(yi-y)+  •  •  •  +  **  2  (*«  ~  xk)  O'*  -  y)]/* 

**  r2 

12  ♦  •  •  A; 

_  blaiy  ^  ’  1  ‘  ^  ^k^ky 

knfyw -it 

with  k  and  n  -  k  -  1  degrees  of  freedom.  We  compare  the  F  calculated 
from  the  data  with  the  critical  value  tabulated  in  Appendix  Table  5,  and 
reject  the  hypothesis  that  all  true  partial  regression  coefficients  equal  zero 
if  the  calculated  F  is  larger. 

Example.  Test  at  the  5%  significance  level  whether  the  true  partial 
regression  coefficients  are  zero  in  the  example  of  Table  6.2. 

We  find  from  Sec.  6.2.3  that 

31,481 

P~  (3)  (19)  (107.81)  ""  5,123 

with  3  and  15  degrees  of  freedom,  which  is  larger  than  the  critical 
value  F  os  (3, 15)  =  3.29  in  Appendix  Table  5.  Hence  we  reject  the 
hypothesis,  concluding  that  the  variance  accounted  for  by  regression  is 
more  than  could  reasonably  be  expected  if  all  the  true  partial  regression 
coefficients  were  zero. 

b.  Significance  of  Multiple  Correlation  Coefficient.  We  can  test 

ry\u"'k  f°r  significance  at  the  1  or  5%  level  by  comparison  with  the  value 
in  Appendix  Table  7  for  k  +  1  variables  and  n  —  k  —  1  degrees  of  free¬ 
dom.  We  reject  the  null  hypothesis  that  the  population  multiple  correlation 
coefficient  is  zero  if  exceeds  the  tabled  value.  If  we  reject  the  null 

hypothesis,  we  conclude  that  the  regression  of  y  on  xlt  x2,  •  •  •  ,  and  xk 
accounts  for  a  significant  amount  of  the  variation  observed  in  y.  This  test 
of  significance  of  the  multiple  correlation  coefficient  is  actually  equivalent 
to  the  F  test  of  Sec.  6.2.5a. 

c.  Significance  of  Partial  Correlation  Coefficients.  We  can  test  any 
partial  correlation  coefficient,  ryl|23...*  for  instance,  for  significance  at  the 
1  or  5%  level  by  comparison  with  the  value  in  Appendix  Table  7  for 
n  -  k  —  1  degrees  of  freedom  and  two  variables.  If  jr„1|23...ft|  exceeds  the 
tabled  value,  we  reject  the  null  hypothesis  that  the  population  partial 
correlation  coefficient  pyl |23...fc  is  zero.  Note  that,  although  we  use  the 
table  column  for  two  variables  (the  variables  and  y,  in  this  case),  we 
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take  care  of  the  deletion  of  the  other  variables  by  using  n  —  k  -  1  degrees 
of  freedom.  Thus,  for  k  =  3  and  n  =  19  the  critical  value  of  |ryl|23|  at  the 
5%  significance  level  is  0.482. 

d.  Confidence  Intervals  for  Partial  Correlation  Coefficients.  This 
subsection,  like  the  preceding  two,  applies  only  to  Type  II  problems.  To 
construct  a  95%  confidence  interval  for  the  population  partial  correlation 
coefficient,  say  p„3|124...fc,  enter  Appendix  Chart  XI  with  the  sample  coeffi¬ 
cient  ry 3|124...fc  and  find  the  interval  cut  off  by  the  curves  labeled  n  —  k  +  1. 
In  95%  of  the  experiments  so  treated,  the  resulting  confidence  interval  will 
contain  the  population  coefficient.  Thus,  if  k  =  3,  n  =  19,  and  ry3|12=  0.6, 
we  interpolate  for  17  between  curves  labeled  15  and  20  and  get  95% 
confidence  limits  of  0.16  and  0.83. 

e.  Significance  Tests  and  Confidence  Intervals  for  Partial  Regres* 
sion  Coefficients.  The  standard  error  of  the  partial  regression  coefficient 
bj  can  be  estimated  from  the  sample  as 

st>j  ~  sv\i2”’kS  ne a  (/ ~  2,  * 

The  values  of  the  ejj  are  the  diagonal  elements  of  the  inverse  matrix  of  the 
normal  equations  computed  in  Sec.  6.2.2. 

To  test  the  null  hypothesis  that  the  partial  regression  coefficient  (3/  in 
the  population  has  any  stated  value  B,  compute 

bj  —  B 
t  =  -i - 

Reject  the  null  hypothesis  at  the  significance  level  a  if  |/|  exceeds  the 
critical  value  ta/2,n-k-i  given  in  Appendix  Table  3  for  n  —  k  —  1  deg-ees 
of  freedom. 

A  100(1  —  a)%  confidence  interval  for  the  population  coefficient  fij  is 

bj  —  ta/2,n-k-isb) 

Example.  In  the  continuing  example,  we  have 

bl  =  -0.597290  en  =  27.7444  X  10'6  ry|123  =  10.383 

b2  =  0.515068  e22  =  13.7631  X  10“6  ;;  =  19 

b3  =  -0.048583  e33  =  58.5787  X  10"6 
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where  ryU.,...k  is  the  multiple  correlation  coefficient. 

If  sy  r,...k  is  based  on  a  sufficiently  large  sample,  it  is  a  good  estimate  of 
the  scatter  of  the  population  about  the  true  or  population  regression  plane. 
If  the  deviations  from  the  plane  are  normally  distributed,  about  95%  of 
the  points  in  a  large  sample  will  lie  within  ±2j„|12.t.A  of  the  plane 
(measured  in  the  y  direction). 

Example.  For  the  example  of  Table  6.2,  we  calculate 


=  (19^15 )  I62-206  -  (-0.597290)  (-20,311) 

-(0.515068)  (37,867)  -  (-0.048583)  (3,173)] 

=  107.81 

s -„lm=  10.383  X  10~3  inch 
whereas 


s 


2 

v 


ayy  _  62,206 

»(»-!)  "(19)  (18) 
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s„  ~  13.487  y  1  O'3  inch 


If  the  deviations  are  approximately  normally  distributed,  about  68%, 
or  13,  of  the  19  observed  inside  diameters  should  lie  within  0.0104  inch 
of  the  values  predicted  from  the  corresponding  Xj  values  by  the  regression 
equation;  actually  12  do  so.  Likewise  about  95%,  or  18,  should  lie 
within  0.0208  inch,  and  19  do  so. 


6.2.4.  Relative  Variation  (Type  II  Problem) 

a.  Multiple  Correlation  Coefficient.  Just  as  in  the  case  of  one  inde¬ 
pendent  variable,  the  population  multiple  correlation  coefficient 

squared,  p2yr,...k,  is  defined  as  the  fraction  of  the  total  variance  of  y  that  is 
accounted  for  oy  its  regression  on  the  variables  xu  x-,y  •  ■  ■  ,  and  xk.  The 
best  estimate  of  p?ylv,...k  from  the  sample  is  the  sample  multiple  correla^ 
fion  coefficient  squared 

2  _  +  ^2V  +  •  •  •  +  bhaHy 

rv\i2---h  a 

avv 

The  multiple  correlation  coefficient  itself  is  always  taken  as  the  positive 
square  root  of  r^12...k,  and  it  is  never  greater  than  one:  0  <  ry!l2.,.fc  <  1. 
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Hence 

sbi  =  10.383  V  5.2714  X  10~*  =  0.23839 
sh  =  10.383  V  2.6150  X  10"4  =  0.16790 
sbi  =  10.383  V  11.1300  X  io=*  =  0.34639 

To  test  at  the  5%  significance  level  the  null  hypothesis  that  a  popula¬ 
tion  partial  regression  coefficient  /?>  is  zero,  we  compute  tf  =  and 
compare  its  absolute  value  with  /  028,i5  =  2.131  from  Appendix  Table  3. 
Since  ba  is  much  smaller  numerically  than  its  estimated  standard  error,  it 
is  not  significant  (i.e.,  not  significantly  different  from  zero)  whatever 
reasonable  significance  level  might  have  been  chosen,  but 


tx  =  -2.506  and  ta  =  3.068 
Hence  we  reject  the  hypotheses  that  fix  =  0  and  /?,  =  0. 

We  may  construct  95%  confidence  limits  for  px  as 

bx  ±  2.13U*  =  -0.5973  ±  0.5080 

=  -1.1053  and  -0.0893 
Likewise  95%  confidence  limits  for  /3,  are 

bt  ±  2.131Jis  =  0.5151  ±  0.3578 
=  0.1573  and  0.8729 

Hence  &  and  j32  are  but  roughly  determined  by  the  19  observations 
available. 

In  general,  when  a  partial  regression  coefficient,  say  bh,  is  not  statistically 
significant,  we  may  choose  one  of  three  courses  of  action: 

1.  Retain  the  term  bk  (xh  -  xh)  in  the  regression  equation 

2.  Calculate  a  new  regression  equation,  preferably  on  the  basis  of  a 
further  sample,  omitting  observations  on  xh  entirely 

3.  Discard  the  term  and  use  the  regression  equation  with  the  other  terms 
already  calculated 

The  first  course  is  the  safest,  but  it  may  be  unnecessarily  complicating  when, 
as  in  the  example  below,  the  term  can  be  discarded  on  the  basis  of  its 
negligible  practical  importance.  If  a  term  is  discarded,  course  2  is  the  proper 
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one  in  general,  in  order  that  the  resulting  reduced  equation  be  a  least-squares 
estimate  of  the  (reduced)  population  regression  plane.  If  the  experiment  is 
designed  with  such  symmetry  that  the  off-diagonal  coefficients  ahj  (h  j) 
of  the  normal  equations  are  zero,  following  course  3  is  equivalent  to 
following  course  2  without  drawing  a  new  sample.  Also,  course  3  is  some¬ 
times  justified  as  an  approximation  to  course  1  when,  as  in  the  example, 
the  differences  between  the  /’ s  as  predicted  by  the  resulting  reduced 
equation  and  by  the  equation  using  all  the  variables  are  negligible. 

In  any  case  the  choice  of  equation  is  aided  by  compr.'ison  of  the  sizes 
of  the  standard  errors  of  estimate  sv\12...k  of  the  various  equations  con¬ 
sidered.  In  an  extreme  case  the  regression  as  a  whole  may  be  significant 
according  to  Sec.  6.2.3a,  although  no  bf  is  significant  individually.  It  may 
happen  in  such  a  case  that  some  subset  of  the  original  set  of  independent 
variables  yields  practically  as  useful  a  regression  as  the  original.  This  is 
exemplified  in  Ref.  6.7,  p.  646,  together  with  a  method  of  finding  the 
appropriate  subset  of  independent  variables. 

Example.  In  the  previous  example,  we  found  that  two  regression 
coefficients,  bx  and  b2,  were  significant,  but  the  third,  b3,  was  not  signifi¬ 
cant.  What  would  the  practical  effect  in  estimating  y  be  if  we  were  to 
discard  the  term  in  x3  ? 

Over  the  range  of  applicability  of  the  regression,  the  maxi¬ 
mum  error  introduced  by  discarding  the  term  b3  (x3  -  x3)  would  be 
(0.0486)  (10.05)  (10~3)  =  0.00049  inch.  Therefore,  we  should  follow 
course  3  of  the  explanation  above,  discarding  the  term  becav.se  its 
contribution  is  not  only  not  significant  statistically  but  is  also  negligible 
numerically. 

f.  Confidence  Interval  for  Ordinate  to  the  True  Regression  Plane. 

Confidence  limits  for  the  true  mean  y  value  (not  for  an  individual  predicted 
y)  at  the  x  point  (Xu  X2,  •  •  •  ,  X*)  are 

/  —  fa/2,n-k-ispli2-->k‘J—~  +  n  2  ehj  (Xh  ~  *h)  (Xj  ~  *j) 

y  n  h,j=i 

where  /  is  calculated  from  the  regression  equation.  The  x  point  need  not 
be  one  of  those  used  in  finding  the  regression  plane.  Note  that  there  are  k 2 
terms  in  the  summation,  k  squares  plus  two  equal  cross-products  for  each 
pair  (b, ;). 
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This  confidence  interval  can  be  particularized  to  cover  the  case  of  the  y 
intercept  by  taking  (X4,  X2)  •  •  •  ,  Xfc)  =  (0,  0,  •  •  •  ,  0)  or  the  case  of 
y  by  taking  (Xlf  X2,  •  •  •  ,  Xk)  =  (x1}  x2,  •  •  •  ,  xk).  Significance  tests 
based  on  the  t  statistic  can  also  be  made. 

Example.  For  the  data  of  Table  6.2,  calculate  95%  confidence  limits 
for  the  mean  y  value  (/  =  118.51  or  (ID)'  =  1.2185  inches)  estimated 
in  Sec.  6.2.2  for  xl  =  11,  x2  =  58,  and  x3  =  11. 

Using  the  values  of  xy  in  Table  6.3  and  the  ekj  calculated  in  Sec.  6.2.2, 
we  have 

n  2  ehf  (Xx  —  X*)  (Xy  ~  Xy) 
h,  j=l 

=  19  x  10-8  [27.74  (— 8.89)2  +  13.76  (19.05)2 

+  58.58  (-7.95)2+  2  (-0.78)  (-8.89)  (19.05) 

+  2  (-1.34)  (-8.89)  (-7.95) 

+  2  (-6.91)  (19.05)  (-7.95)] 

=  0.2481 

Hence  the  confidence  limits  are 

118.51  ±  (2.131)  (10.383)  (0.5484)  =  118.51  ±  12.13 

=  106.38  and  130.64 

or  1.2064  and  1.2306  inches,  in  terms  of  the  original  scale  for  inside 
diameter. 

g.  Prediction  Interval  for  Individual  Value  of  y.  The  appropriate 
discussion  here  is  the  same  as  in  Sec.  6.1. 4f.  A  100(1  -  a)%  prediction 
interval  for  an  individual  value  of  y  for  the  x  point  (Xu  X2,  •  •  •  ,  Xk) 
differs  from  the  100  (1  -  a)%  confidence  interval  for  the  ordinate  to  the 
true  regression  plane  given  in  Sec.  6.2. 5f  only  in  replacing  the  first  term 
under  the  radical,  l/n,  by  1  +  1  /«. 

Example.  For  the  data  of  Table  6.2,  calculate  a  95%  prediction 
interval  for  an  individual  value  of  y  for  xx  —  11,  x2  =  58,  and  x3  =  11. 

The  calculation  involves  the  summation  evaluated  in.  the  example  in 
Sec.  6.2. 5f.  The  prediction  limits  are 


► 


l  ■ 


V 


k 
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118.51  ±  (2.131)  (10.383)  (1.1405)  =118.51  ±  25.23 

=  93.28  and  143.74 

or  1.1933  and  1.2437  inches,  in  terms  of  the  original  scale. 

6.3.  Non-Mnear  Regression 

This  section  applies  to  Type  I  problems  but  not  to  Type  II  problems  (Sec. 
6.0).  As  noted  in  Sec.  6.0,  the  method  generally  used  for  fitting  a  line, 
curve,  or  surface  is  the  method  of  least  squares.  If,  except  for  random 
fluctuation  in  y  (which  need  not  be  normally  distributed),  y  is  related  to 
variables  xlt  x2  •  •  •  ,  x*  by  a  function  g  (xt,  x2,  •  •  •  ,x*;  clt  c2,  •  •  •  ,cm) 
of  given  form  with  unknown  parameters  cx,  cs,  •  •  •  ,  cn,  then  these 
parameters  may  be  estimated  from  observations  xlit  x2i,  •  •  •  ,  xki,  y{ 
(where  i  -  1,  2,  •  •  •  ,  n)  which  have  known  "weights”  w{  by  minimizing 
the  weighted  sum  of  squared  deviations 

n 

•^(fX>^2i  '  '  '  >  Cm)  ~  2  [^i  £  (*10  *2i»  *  '  *  >*i fci»  ci>  c2i  '  '  * 

<=l 

with  respect  to  rx,  c2,  •  •  ,  cm.  (Throughout  this  Manual  all  —  1.) 

This  minimization  is  generally  done  by  differentiating  S  (clt  c2,  •  •  •  ,  cm) 
with  respect  to  each  .  v  u  equating  to  zero,  thus  getting  m  equations  for 
estimating  clr  c2,  •  •  •  ,  cm. 

Many  non-linear  relationships  can  be  analyzed  by  transforming  the 
original  variables  into  new  ones  that  are  related  linearly.  Thus,  to  fit  a 
cubic  equation 

Y  —  a  +  £tx  +  £2x2  +  b3x3 

we  let  xx  =  x,  x2  =  x2,  and  x3  -  x3.  With  these  meanings  for  xx,  x2,  and 
x3,  the  formulas  for  multiple  regression  in  Sec.  6.2  may  be  used  immediately 
for  non-linear  regression.  The  intercept  on  the  y  axis  is  estimated  as 

a  =  y-b1x-b~2~b^ 

where  x2  is  the  mean  of  the  squares  of  fUe  observed  values  of  x  and  *3  is 
the  mean  of  the  cubes  of  the  observed  values.  We  note  that  xx,  x2,  and  x3 
are  certainly  not  statistically  independent  here,  but  independence  is  not 
necessary  in  multiple  regression  analysis. 


V  1 


l 


I 
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The  methods  detailed  in  Sec.  6.1  and,. 6. 2  can  be  used  whenever 
f  ~  8  (*i>  x2>  '  '  ’  >  xk>  ci>  c2,  •  •  •  ,  cm)  is  linear  in  the  parameters  to  be 
estimated.  For  example,  the  equation 

/  =  a  +  bjfzx  +  b2zx  log  z2  +  b3/z\ 

can  be  fitted,  using  the  formulas  of  Sec.  6.2,  by  letting  xx  =  V  z1 , 
x2  —  zx  log  z2,  and  x3  —  1  /z\.  The  y  intercept  is  estimated  as 

a  =  y~  bjfzx  -  Mi  log  “  h  JJM) 

Ir.  particular,  the  test  of  a  hypothetical  form  of  an  equation  in  Sec.  6.1.6 
is  available  if,  as  is  recommended,  several  observations  on  y  can  be  made 
for  each  value  of  xlt  x2,  ♦  •  •  ,  and  if  the  random  fluctuations  in  y 
are  normally  distributed. 

Sometimes  a  transformation  of  /  as  well  as  x  (or  the  x}  in  multiple 
regression)  may  render  /  linear  in  the  parameters.  For  example,  /xb  =  c 
becomes 

log/  =  logc  -  Mogx 

by  taking  logarithms.  By  letting  Y'  —  log/,  C  =  logc,  and  X  =  logx, 
the  problem  is  reduced  to  the  linear  case  of  Sec.  6.1.  Other  transformations 
of  value  arc 

X  =  eax  and  X  =  xa 

0 _ 

The  latter  transformation  includes  X  =  1/x6  and  X  =  \~x,  since  a  can  be 
any  known  non-zero  constant.  Similar  changes  of  variable  may  be  made 
on  /. 

If  a  transformation  of  /  is  used,  say  to  log/,  the  fitted  line  or  plane 
minimizes  the  sum  of  squares  of  deviations  of  log  y  rather  than  of  y;  if  y 
is  normally  distributed,  logy  will  not  be.  Although  in  this  case  the 
theoretically  bes;  estimate  of  the  regression  curve  of  y  on  the  x’s  is  not 
obtained,  the  approximation  is  often  sufficiently  close.  In  case  the  regression 
function  cannot  be  transformed  so  that  the  unknown  coefficients  are 
involved  linearly,  the  least-squares  equations  may  be  insoluble  except  by 
a  method  of  successive  approximations.  (See  Ref.  6.12  and  also  Ref.  6.7, 
pp.  558-70  and  649-57,  for  further  discussion  of  the  fitting  of  non-linear 
equations.) 
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6.3.1.  Polynomial  Equations 

For  curvilinear  relationships,  a  polynomial  is  frequently  fitted  unless 
there  is  some  theoretical  basis  for  using  a  different  curve.  The  polynomial 
equation  is 

/  —  y  -  bx  (x  -  x)  +  b2  (x2  —  x2)  +  •  •  •  +  6*  (x*  -  x^ 

where 

^  =  {/  =  m,  •  •  •  ,*) 

« i=i 

The  normal  equations  of  Sec.  6.2.2  for  determining  the  coefficients  hold 
as  before 

*1 A  +  *12^2  +  •  *  *  +  =  *1V 

d2-\b\  *b  (t22b2  ~f~  ...  — J—  <?2J/ 


<*fcA  +  **2^2  +  •  •  •  +  rffcA  = 

where  now 

=  n 2 ”  2*^2*’ 

*/irs=»2*,7-2*,2j  , 

As  in  Sec.  6.2.2,  the  equations  may  be  solved  for  the  b}  in  the  easiest 
available  way.  All  the  methods  of  Sec.  6.2  for  a  Type  I  model  can  be 
applied  directly  to  the  polynomial  equation  and  need  not  be  repeated  here. 
In  particular,  the  variance  about  the  regression  curve  is  defined  as 

(*-/<)’ 

and  is  calculated  as 

sv\r  ~  n  (p  _  £  _  0 hv  ~  biaiv  ~  —  •  •  •  —  btftjty) 


where 


^v  =  »2>2  "  (2>): 
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Since  the  present  situation  is  simply  that  of  multiple  linear  regression 
with  Xj  replaced  by  x’,  the  significance  of  any  regression  coefficient  can 
be  tested  by  the  method  of  Sec.  6.2. 5e.  If  there  is  uncertainty  about,  the 
degree  of  equation  appropriate  for  the  data,  it  is  desirable  to  start  with  an 
equation  of  the  highest  possible  degree  of  interest,  and  subsequently  to 
discard  those  terms  which  contribute  little  to  the  total  estimate  /  and  have 
coefficients  that  are  not  significant.  In  this  situation  the  use  of  orthogonal 
polynomials  (described  in  the  next  section)  is  convenient.  The  method  does 
not  require  further  calculation  for  the  reduced  equation;  however,  its 
application  is  practical  only  if  the  values  of  *  are  equally  spaced. 

6.3.2.  Orthogonal  Polynomials,  When  x  Is  Equally  Spacod 

When  an  ordinary  polynomial  is  to  be  fitted  (Sec.  6.3.1),  all  the 
coefficients  must  be  recalculated  each  time  a  power  term  is  added  or  dis¬ 
carded.  With  orthogonal  polynomials  an  analysis  of  variance  is  per¬ 
formed  to  apportion  the  variance  among  terms  of  the  polynomial.  The 
contribution  of  each  term  is  tested;  if  it  is  not  found  significant,  the  term 
may  be  discarded  without  recalculating  the  previously  obtained  coefficients. 
For  fitting  curves  of  higher  degree,  this  method  saves  time.  Furthermore, 
the  tests  of  significance  of  effects  are  isolated,  as  in  the  simpler  cases  of 
analysis  of  variance. 

The  orthogonal  polynomials  themselves  depend  on  the  arrangement  of 
the  values  of  the  independent  variable  x.  For  the  commonly  tabulated 
orthogonal  polynomials  used  here,  the  values  of  x  must  be  equally  spaced. 
The  method  yields  an  equation 

y-y+  (*)  +  A2&  (x)+  •  •  •  +  Ak&  (x) 

where  the  A}  are  coefficients  to  be  determined,  and  the  (x)  are  known 
polynomials  in  x — orthogonal  in  the  sexs-i  that  the  products  (with 
h  =£  j)  over  n  equally  spaced  values  of  x  sum  to  zero. 

In  order  to  tabulate  the  values  of  the  orthogonal  polynomials  con¬ 
veniently  for  repeated  use,  the  values  of  x — say  x* — are  taken  to  be  one 
unit  apart,  and  £'■  (x)  is  taken  as  a  multiple  X,  of  a  corresponding  poly¬ 
nomial  I,-  (x)  with  leading  coefficient  unity.  The  latter  adjustment  makes 
all  tta  tabulated  values  (x{)  integers.  Thus,  in  particular,  we  have 
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&(*)  =  *i£1(x)  =  Ax  (x-x) 

&  (x)  =  A2&  ( X )  =  A2£(x  -  x)2  -  -Z  12-  -J 

&  ( X )  =  A3g3  (x)  =  A3[jx  -  x)3  -(x-x) ln  20~  ] 

« (*)  =  Kt>  (*)  =  A,[(*  -  *)*  -(x  -  J)^r S] 

anti  in  general  £'  (x)  =  Af  £r  with  the  recursion  formula 

r2  (n 2  —  r2) 

i"i  =  ter-4ferrt)ir-* 

where 


|0  =  1  and  gx  =  x  -  x 

The  coefficients  Aj  of  a  fitted  polynomial  are  calculated  from  the  observa¬ 
tions  yi}  using  the  formula 


At  — 

The  values  of  the  individual  (xj)  and  the  sums  of  squares  2  £?  (xi)  are 
given  in  tables  of  orthogonal  polynomials.  Appendix  Table  15,  which  is 
an  excerpt  from  a  more  extensive  table  given  in  Ref.  5.7,  illustrates  the 
arrangement  of  that  table  and  suffices  for  the  example  below;  the  complete 
table  of  Ref.  5.7  would  have  to  be  consulted  in  general.  The  tables  also  give 
values  of  the  Ay,  which  depend  on  n\  with  these  /  can  be  expressed  directly 
in  powers  of  x. 

Example.  From  the  data  in  columns  2  and  3  of  Table  6.4,  fit  a 
polynomial  curve.  The  values  of  the  independent  variable  x  are  coded 
one  unit  apart  in  column  2  of  the  table.  It  is  assumed  that  terms  of 
degree  higher  than  four  are  known  from  theory  not  to  be  present  or  are 
certain  to  be  of  negligible  interest. 

Columns  4-7  and  the  values  of  2  £'■’  and  Ay  are  taken  from  the  table 
of  orthogonal  polynomials,  Appendix  Table  15,  for  n  =  15.  The  values 
for  x  >  8  are  given  directly  in  that  table.  The  values  for  x  <  8  are  the 
same  (symmetrically  about  x)  for  &  and  &  and  the  same  except  for 


2  *«(».) 


188 


FITTING  A  FUNCTION 


$6;3: 


TABLE  6.4.  Example  of  Use  of  Orthogonal  Polynomials 


Xi 

y< 

& 

£ 

& 

Original 

Coded 

(1) 

(2) 

(3) 

(4) 

(5) 

(6) 

(7) 

45.2 

1 

7.8 

-7 

91 

-91 

1,001 

45.4 

2 

7.9 

—  6 

52 

-13 

-429 

45.6 

3 

8.8 

-5 

19 

35 

-869 

45.8 

4 

9.3 

-4 

-8 

58 

-704 

46.0 

5 

8.7 

-3 

-29 

61 

-249 

46.2 

6 

8.4 

-2 

-44 

49 

251 

46.4 

7 

7.8 

-1 

-53 

27 

621 

46.6 

3 

7.2 

0 

-56 

0 

756 

46.8 

9 

6.7 

1 

-53 

-27 

621 

47.0 

10 

6.4 

2 

-44 

-49 

251 

47.2 

11 

5.7 

3 

-29 

-61 

-249 

47.4 

12 

4.6 

4 

-8 

-58 

-704 

47.6 

13 

3.7 

5 

19 

-35 

-869 

47.8 

14 

3.0 

6 

52 

13 

-429 

48.0 

15 

2.0 

7 

91 

91 

1,001 

X, . 

2  • 

98.0 

280 

1 

-128.4 

37,128 

3 

-655.6 

39,780 

% 

170.3 

6,466,460 

3%2 

-937  5 

change  of  sign  for  £  and  &  The  sums  of  products  2  J\  £/  (*i)  must  be 
calculated  and  are  given  at  the  bottom  of  Table  6.4.  The  coefficients  in 
the  regression  equation  are  then  obtained  immediately  as 

y  =  6.5333 

-128.4  170.3 

A  =  -280-  =  “0.45857  As  =  =  0.0042810 

*  -  Wi  =  ~0'017658  At = 5*5®  =  -°00014498 

An  analysis  of  variance  (Chapter  5)  is  performed  to  test  the  signifi¬ 
cance  of  the  terms  Aj  £)  (x).  We  first  compute  the  sum  of  squared 
deviations  of  y  from  y,  which  is  the  total  sum  of  squares;  then  in  turn 
we  compute  that  part  of  the  total  sum  of  squares  accounted  for  by  the 
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linear,  quadratic,  cubic,  and  quartic  regressions.  These  sums  of  squares, 
their  degrees  of  freedom,  and  the  corresponding  mean  squares  appear 
in  Table  6.5.  The  sum  of  squares  accounted  for  by  the  ;th  degree  term 
A)£'j(x)  is  calculated  as 

2f" 

The  residual  sum  of  squares,  that  part  of  the  total  not  accounted  for  at 
all  except  as  random  fluctuation,  is  obtained  by  subtraction  as  shown  in 
the  table. 

In  testing  the  ;th  degree  polynomial  term  for  significance,  we  are 
testing  the  null  hypothesis  that  the  population  has  a  polynomial  regres¬ 
sion  with  the  true  value  of  the  coefficient  Aj  equal  to  zero.  Rejection  of 
this  hypothesis  implies  that  the  data  are  significantly  better  represented 
by  a  polynomial  which  includes  £$  ( x )  than  by  a  polynomial  which  does 
not.  (It  is  possible,  though  perhaps  not  likely,  that  we  might  reject,  for 
example,  the  hypothesis  that  A4  =  0  and  not  reject  the  hypothesis  that 
A2  ~  0.)  We  compare  the  F  values  with  F.os  (1, 10)  =  4.96  froi 
Appendix  Table  5,  and  we  see  that  all  terms  but  the  quartic  are  signifi¬ 
cant.  Therefore  we  use  only  the  first  three  in  the  equation 

y  =  y  "b  Ai  (x)  +  A2  £2  (*)  d*  A 3  £3  (x) 

With  n  =  15,  =  1,  X2  =  3,  and  A3  =  %  from  Table  6.4  we  obtain 

£1  (x)  =  x  -  8 
£(*)  =  3x2  -  48x+  136 

£3  (x)  =  \  (5x3  -  120x2  +  793x  -  1,224) 

Substitution  of  these  and  the  values  for  y,  Alf  A2,  and  A3  in  the  equation 
for  /  yields 

/  =  0.00357X3  -  0.1386x2  +  0.955x  +  6.93 

where  x  is  the  coded  variable.  The  equation  is  plotted  in  Fig.  6.5  along 
with  the  original  data. 


(3)  Quadratic  regression: 
(2**)! 
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FIG.  6.5.  Cubic  Fitted  to  Data  of  Table  6.4. 


6.4.  Planning  th*  Experiment  for  Fitting  a  Function 

The  general  steps  in  designing  and  running  an  experiment  outlined  in 
Sec.  5.1,  as  well  as  much  of  the  discussion  of  Sec.  5.0,  apply  to  the  fitting 
of  functions.  In  particular,  we  should  use  one  of  the  following  procedures: 
(1)  hold  background  variables  constant  at  the  values  of  interest  (thus 
limiting  the  applicability  of  the  results),  (2)  provide  for  including  back¬ 
ground  conditions  as  independent  variables  in  the  analysis  by  controlling 
or  measuring  them,  or  (3)  randomize  the  effects  of  background  conditions 
on  the  observed  variables.  For  instance,  in  estimating  the  regression  func¬ 
tion  of  y  on  one  independent  variable  x,  it  would  be  poor  technique  to 
begin  with  the  smallest  value  of  x  and  proceed  to  successively  larger  values, 
because  other  variables  affecting  y  (such  as  temperature  or  instrument  cali¬ 
bration)  may  change  with  time  in  a  systematic  manner.  It  would  be  better 
to  randomize  the  order  of  application  of  the  values  of  x  by  consulting  a 
random-number  table  (Ref.  1.7). 

In  a  Type  I  problem  the  experimenter  chooses  the  values  of  x  (or  of 
x1(  x2,  •  •  •  ,  Xy'  for  which  to  observe  y.  If  he  does  not  know  the  form  of 
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the  relation  between  y  and  x,  he  should  choose  the  x  values  at  a  number  \ 

of  equidistant  points  over  the  range  of  interest.  If  he  knows  from  theory  \ 

or  past  experience  that  the  relation  is  expressible  as  a  polynomial  in  x  of  j 

degree  no  higher  than  k,  he  can  minimize  the  largest  variance  of  the 
ordinate  to  the  fitted  curve  over  the  range  of  interest  by  choosing  the  x 
values  in  k  +  1  equal  groups,  spaced  over  the  range.  For  example,  if  the 
curve  is  known  beforehand  to  be  a  straight  line  (that  is,  a  polynomial  of 
degree  one),  the  most  precise  estimates  of  ordinates  to  the  line  are  obtained 
by  taking  half  the  observations  at  one  end  of  the  range  and  half  at  the  j 

other.  This  gives  no  test  of  linearity;  and,  in  general,  the  number  of  groups 
of  x's  should  not  be  restricted  to  k  +  1  unless  terms  of  degree  higher  than  ] 

k  are  certainly  negligible.  ] 

In  the  case  of  Type  I  multiple  regression,  the  values  of  the  variables  xf  ] 
should  be  chosen  in  a  symmetric  arrangement.  Symmetry  of  some  kind  is 
valuable  not  only  for  obtaining  the  maximum  information  per  observation  ! 
from  the  experiment  but  also  for  simplifying  the  calculations.  Symmetry 
leads  to  simple  normal  equations  in  which  each  of  the  unknowns  bt  occurs 
in  only  one  of  the  normal  equations  so  that  it  can  be  solved  for  immediately.  j 

Furthermore,  the  /  test  of  significance  of  each  b ,  will  be  independent  of  \ 

effects  of  the  other  x,-  variables;  this  desirable  result  is  similar  to  that  found  ; 

in  fitting  by  use  of  orthogonal  polynomials  (Sec.  6.3.2),  and  in  the  analysis  j 

of  variance  of  an  experiment  designed  with  appropriate  symmetry  j 

(Chapter  5).  j 

In  the  case  of  Type  II  problems  it  may  happen  that  a  large  number  of  j 

independent  variables  are  present  and  observable,  so  that  the  question  may  j 

arise  as  to  whether  they  should  all  be  observed  and  included  in  the  analysis.  | 

Inclusion  of  a  further  independent  variable  in  the  analysis  will  always 
decrease  the  sum  of  squared  deviations  of  the  data  from  the  fitted  regression  j 

function.  However,  if  the  additional  variable  actually  has  little  effect  on  \ 

the  dependent  variable,  its  inclusion  may  not  decrease  the  mean  square  \ 

deviation,  since  the  number  of  degrees  of  freedom  is  decreased  also.  The  ■ 

result  is  that  the  experiment  is  less  sensitive;  that  is,  all  F  ratios  are  decreased  ; 
and  their  critical  values  are  increased.  j 

Moreover,  if  regression  analysis  is  being  applied  to  a  Type  II  problem,  j 

two  variables  Xy  may  be  highly  correlated;  if  so,  one  of  them  should  be  j 

omitted  from  the  regression  equation  because  the  two  regression  coefficients  j 

j 

s 

i 


S6.4. 


PLANNING  THE  EXPERIMENT  FOR  FITTING 


193 


would  be  poorly  determined,  whereas  one  by  itself  can  be  well  determined. 
An  extreme  example  might  occur  in  a  regression  on  percentage  composi¬ 
tion  of  some  measurable  characteristic  of  a  solid  propellant.  Each  of  the 
components  might  conceivably  be  taken  as  an  xjf  but  if  the  percentage  of 
one  were  determined  as  a  difference  between  100  and  the  sum  of  the  other 
percentages,  it  would  be  linearly  dependent  on  the  others,  the  determinant 
of  the  normal  equations  would  be  zero,  and  the  regression  coefficients  could 
not  be  determined.  One  of  the  components  should  therefore  be  omitted 
from  the  regression  equation. 

It  is  important  to  take  a  sufficiently  large  sample  to  determine  the  regres¬ 
sion  function  with  the  desired  precision,  but  it  is  somewhat  complicated 
to  predict  the  sample  size.  In  addition  to  the  form  of  the  regression  equa¬ 
tion,  one  must  know  approximately  the  population  standard  deviation  vy>r 
about  the  regression  function  and  the  spacing  of  x  values  to  be  used.  For 
example,  with  a  straight  line  and  the  values  of  x  equally  distributed  at  the 
two  end  points  of  the  range  of  x,  the  first  equation  of  Sec.  6.1.4c  indicates 
that  the  standard  deviation  of  the  estimate  of  the  regression  coefficient  is 


where  2 L  is  the  total  range  of  x.  Knowing  ay\T,  L ,  and  the  desired  precision 
<rb  in  estimating  the  slope,  we  can  solve  this  equation  for  the  number  n  of 
observations. 

The  design  of  experiments  for  regression  is  treated  in  Ref.  6.7,  especially 
on  pages  536  and  632-34. 
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Chapter  7 

QUALITY-CONTROL  CHARTS 


7.0.  Introduction 

Quality-control  charts  are  a  graphical  means  for  detecting  systematic 
variation  from  the  quality  to  be  expected  in  a  continuous  production  line; 
that  is,  variation  greater  than  the  random  fluctuation  which  is  inevitable 
and  allowable.  The  charts  may  be  used  as  a  simple  method  of  performing 
statistical  tests  (such  as  those  in  Chapters  2-5)  of  the  hypothesis  that 
subsequently  produced  articles  have  essentially  the  same  quality  character¬ 
istics  as  previously  produced  articles.  Quality-control  charts  relate  to  a 
production  line  or  process  in  action  and  indicate  when  a  process  should 
be  examined  for  trouble;  whereas  acceptance  sampling  (Chapter  8)  relates 
to  entire  lots  of  product,  and  enables  a  buyer  to  decide  on  the  basis  of  a 
sample  whether  to  accept  a  given  lot  from  a  supplier. 

Both  quality-control  and  acceptance  inspection  procedures  are  based  on 
judgment  of  the  whole  by  a  part;  that  is,  they  are  based  on  sampling.  These 
modern  techniques  have  been  found  preferable  to  screening  (100%  inspec¬ 
tion)  in  most  cases  because  of  lower  cost.  The  work  load  in  screening 
sometimes  requires  each  inspection  to  be  superficial  or  haphazard,  so  that 
a  sampling  plan  involving  careful  inspection  may  actually  give  more  reliable 
information  about  a  whole  process.  Also,  if  the  test  destroys  the  item 
inspected  (a  flash  bulb,  for  instance),  screening  is  out  of  the  question. 

Quality-control  charts  may  be  used  to  keep  a  process  up  to  a  specified 
standard  or  simply  to  keep  a  process  stable.  A  separate  control  chart  should 
be  used  for  each  part  produced  or  each  different  operation  intended  to 
produce  different  results.  For  instance,  separate  charts  would  be  used  for 
the  operation  of  cutting  stock  into  lengths  for  fins  and  for  the  subsequent 
shaping  operation.  To  spot  flaws  dose  to  their  source,  a  sample  for  each 
control  chart  should  be  taken  whenever  there  has  been  a  possibility  of 
change  in  the  process,  as  when  the  shift  of  workers  has  changed  or  the 
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pressure  has  been  increased.  Thus,  the  flow  of  product  is  divided  into 
sampling  units  called  rational  subgroups,  chosen  so  that  a  change  between 
two  subgroups  may  give  an  indication  of  its  cause. 

In  both  quality-control  work  and  acceptance  sampling,  a  product  may  be 
judged  by  attributos  or  variablts.  The  attributes  method  involves  merely 
a  decision  as  to  whether  an  item  is  acceptable  or  defective;  the  variables 
method  involves  taking  and  recording  measurements.  These  two  approaches 
are  familiar  in  assigning  school  grades,  which  may  be  recorded  as  "pass” 
or  "fail”  (attributes),  or  as  percentages  (variables). 

Variables  testing  preserves  more  information  from  the  sample,  but  is 
sometimes  unnecessary  or  inapplicable.  For  instance,  to  test  whether  the 
length  of  a  fuze  falls  between  XVa  and  IV2  inches,  we  may  record  the 
measured  length,  or  we  may  simply  record  "within  tolerances”  or  "not 
within  tolerances”;  but  to  test  whether  an  aluminum  disk  has  been  inserted 
during  assembly,  we  can  only  record  "yes”  or  "no.” 

The  technique  of  making  quality-control  charts  is  to  set  up,  in  graphical 
form,  limits  for  a  sample  statistic  on  the  basis  of  approximately  25  pre¬ 
liminary  samples  from  production,  all  samples  preferably  having  the  same 
number  of  items.  If  one  of  the  preliminary  samples  yields  a  statistic  outside 
the  computed  control  limits,  the  production  was  not  initially  under  control 
and  should  be  stabilized  before  the  control  chart  is  set  up.  The  sample 
statistic  may  be  the  proportion  defective,  number  of  defective  items,  mean, 
range,  etc.  The  values  of  the  statistic  should  cluster  about  a  central  line 
(central  value),  which  may  be  a  set  nominal  value  in  the  case  of  produc¬ 
tion  to  a  specified  standard,  or  the  mean  of  previously  obtained  values  in 
the  case  of  production  which  is  to  be  kept  stable.  The  upper  and  lower 
control  limits  (UCL  and  LCL,  respectively)  are  usually  set  up  symmetrically 
at  ±  3cr  from  the  central  line  (where  a  is  the  standard  deviation  of  the 
statistic) .  Then  so  long  as  production  remains  satisfactory,  the  probability 
that  a  value  outside  these  limits  will  occur  is  less  than  0.3%  (for  a  normal 
distribution) .  Therefore,  if  during  the  course  of  production  a  value  from 
one  of  the  rational  subgroups  should  be  recorded  outside  the  limits  on  its 
control  chart,  it  would  be  concluded  that  a  change  in  the  production  process 
had  occurred  and  the  process  would  immediately  be  investigated  to  deter¬ 
mine  the  cause  of  the  change.  In  this  way  many  poor  production  practices 
are  caught  and  corrected  before  the  production  of  large  numbers  of  defec¬ 
tives.  A  typical  control  chart  is  shown  r.  Fig.  7.1. 
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FIG.  7.1.  Typical  Control  Chart. 

I  A  control  chart  showing  constantly  increasing  (or  decreasing)  values 

may  lead  to  an  investigation  before  the  control  limits  are  actually  passed. 
Values  falling  consistently  very  close  to  the  central  line  should  indicate 
excellent  production,  but  may  reflect  nonrandom  sampling.  Values  consis¬ 
tently  below  (or  above)  the  central  line  may  suggest  trouble  even  when 
they  do  not  pass  the  control  limits. 

The  steps  in  controlling  quality  by  statistical  charts  are  the  following: 

1.  Decide  on  a  statistic  (based  on  attributes  or  on  variables)  for  each 
production  operation. 

; 

2.  Divide  each  process  into  rational  subgroups. 

3.  Decide  on  a  sample  size  >/. 

:  4.  Construct  a  control  chart  showing  the  central  line,  UCL,  and  LCL  for 

the  statistic,  based  on  at  least  25  preliminary  samples  from  satisfactory 
production,  all  samples  preferably  having  the  same  number  of  items.  The 
I  central  line  may  have  a  preassigned  standard  value  or  may  be  determined 

/  from  the  preliminary  samples.  (Sec.  7.1  and  7.2  explain  the  methods  of 

I  computing  the  limits.) 


* 
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5.  Plot  the  values  from  samples  taken  during  the  course  of  production. 
Consider  as  indications  of  trouble:  (a)  values  falling  outside  the  limits; 
( b )  values  consistently  too  close  to  the  central  line;  and  (c)  values  con¬ 
sistently  above  (or  below)  the  central  line. 

6.  If  a  situation  listed  in  the  preceding  paragraph  occurs:  (a)  locate  and 
correct  any  flaw  in  production;  or  (b)  investigate  the  possibility  of  poor 
sampling;  or  (c)  recognize  that  production  conditions  have  changed  and 
institute  a  new  control  chart  (if,  after  the  change,  the  process  still  yields  a 
satisfactory  product). 

Attention  is  called  to  the  references  at  the  end  of  this  chapter.  Also 
applicable  to  the  problems  involved  in  controlling  quality  are  Sec.  4.1,  Run 
Test  for  Randomness;  Sec.  4.9,  Tolerance  Intervals;  and  Chapter  8,  Accept¬ 
ance  Sampling. 

In  Sec.  7.1  and  7.2,  the  following  notation  is  adopted:1 
k  Number  of  preliminary  samples  used  to  set  up  chart  (at  least  25). 
n  Sample  size. 

n 

x  Sample  average,  computed  as  x  ~  2  xi/n- 

i=i 

»  _ 

S  Sample  sum,  computed  as  5  =  2  Note  that  x  =  S/n.  (S  can  be  used 

instead  of  3?  in  a  control  chart.  This  eliminates  any  need  for  division 
byn.) 

R  Sample  range,  computed  as  largest  reading  minus  smallest  reading  in  a 
sample. 

s  Sample  standard  deviation,  computed  as 


gSxf -  (2*0* 

\  n(n  -  1) 

p  Proportion  defective  in  sample,  computed  as 


_  number  of  defective  items  found 
^  number  of  items  inspected 

1The  formula  and  notation  used  here  for  the  sample  standard  deviation  s  are 
consistent  with  the  rest  of  the  Manual,  but  inconsistent  with  much  of  the  control-chart 
literature.  In  using  references  the  reader  should  check  to  see  how  j  or  <r  has  been 
defined. 
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c  Number  of  defects  in  sample.  If  several  defects  are  found  in  one  item, 
they  are  all  counted. 

s  Weighted  sample  standard  deviation  obtained  from  k  samples  of 
slightly  differing  sizes,  computed  as 

-  _  n\h  -l"  nih  +  •  •  •  +  nksk 
»i  +  »2  +  •  •  •  +  nk 

s  reduces  to  s  if  the  sample  sizes  are  equal. 

Note:  A  bar  over  any  statistic  indicates  an  average  taken  over  all  the 

_  k 

values  of  that  statistic.  Thus,  5=2  5>/£,  that  is,  the  sum  of  the 
sums  j)  for  each  of  the  k  samples,  divided  by  k.  Similarly, 

k  k 

x  =  2  xj/k,  the  average  of  the  k  averages  xjt  and  f  —  2  P)/k> 

j=i  j=i 

the  average  of  the  k  sample  proportions  defective. 

The  constant  factors  A,  A0,  Au  A2,  Bu  B2,  B3,  Bi}  c2,  Dlt  D2,  Da,  Dt, 
and  d2  are  found  in  Appendix  Table  18. 

7.1.  Charts  Using  Attributes 

This  section  of  the-  Manual  is  presented  in  tabular  form  for  compactness 
and  ease  of  comparison.  By  using  Table  7.1  the  reader  can  readily  determine 
the  central  line  and  the  control  limits  for  any  quality-control  chart  of  the 
attributes  type. 

Example.  The  average  proportion  defective  p  for  25  preliminary 
samples  of  50  igniters  each  is  0.10.  What  are  the  control  limits  for  p 
for  samples  of  size  50? 

The  control  limits  are  computed  as 

=  -0.03  and  0.23 

Since  a  negative  proportion  defective  is  impossible,  we  replace  the 
lower  control  limit,  —0.03,  by  zero. 
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These  limits,  along  with  the  central  line  and  the  first  12  subsequent 
sample  values  of  p,  are  shown  in  Fig.  7.2.  Samples  4  through  12  show 
constantly  increasing  p.  Since  such  a  lo  g  unbroken  trend  is  unlikely  to 
result  from  chance  alone,  the  production  process  may  well  be  checked  at 
this  point  before  production  continues. 

Example.  In  recent  satisfactory  production,  an  average  of  4.1  flaws 
has  been  found  for  each  sample  of  20  propellant  grains.  For  instance, 
the  first  sample  of  20  grains  contained  one  flaw  in  the  eleventh  grain 
(c  -  1);  the  second  sample  of  20  grains  contained  two  flaws  in  the  third 
grain  and  one  flaw  in  the  seventeenth  grain  (c  —  3).  Construct  a  3<r-limit 
control  chart. 

The  control  limits  are  ?  ±  3Y~?  =  4.1  ±  3"V  4.1  =  4.1  ±  6.1.  Since 
c  is  never  negative,  we  use  LCL  =  0,  UCL  =  10.2. 

From  Fig.  7.3  the  production  process  appears  to  be  under  control. 


0  i - 1 - ' - ' 

I  5  !0  15 

SAMPLE  NUMBER 


FIG.  7.2.  Control  Chart  for  Proportion  Defective. 
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FIG.  7.3.  Control  Chart  for  Number  of  Flaws. 

7.2.  Charts  Using  Variables 

Table  7.2  is  to  be  used  in  conjunction  with  Appendix  Table  18  for 
setting  up  the  central  line  and  control  limits  for  any  quality-control  chart 
of  the  variables  type.  Allowance  for  varying  sample  sizes  is  made  only  in 
two  cases.  Reference  7.1  (pages  60-99)  gives  methods  and  examples  for 
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TABLE  7.3.  Mean  and  Range  Data  in  Seconds  for 
Samples  From  Flare  Production 


Sample 

Mean,  x 

Range,  R 

Sample 

Mean,  x 

Range,  R 

1 

64. 97 

9.8 

14 

66.60 

0.6 

2 

64.60 

9.8 

15 

66.12 

6.3 

3 

64.12 

8.4 

16 

63.22 

7.5 

4 

68.52 

3.9 

17 

62.85 

6.7 

5 

68.35 

7.6 

18 

62.37 

4.9 

6 

67.87 

8.7 

19 

61.97 

6.7 

7 

64.97 

0.1 

20 

61.60 

9.9 

8 

64.60 

9.7 

21 

61.12 

6.9 

9 

64.12 

7.7 

22 

65.72 

0.1 

10 

63.22 

7.5 

23 

65.35 

8.3 

11 

62.85 

1.2 

24 

64.87 

5.2 

12 

62.37 

9.8 

25 

61.97 

3.2 

13 

66.97 

6.4 

these  and  other  cases  of  varying  sample  sizes.  The  average  sample  size 
may  sometimes  be  used  when  variation  is  slight  (Ref.  7.1,  p.  116). 

Example.  Twenty-five  samples  of  three  are  taken  from  production 
of  flares  for  the  purpose  of  establishing  3tr  control  limits  for  the  time  of 
the  illumination.  The  times  of  the  first  sample  are,  for  example,  60.5, 
70.3,  and  64.1  seconds,  giving  a  mean  of  64.97  seconds  and  a  range  of 
70.3  -  60.5  =  9.8  seconds.  Sample  data  are  given  in  Table  7.3. 

We  compute 

2  3?  =  1,611.29  2*  =  156.9 

3  =  64.452  H  =  6.28 

#=  3 

From  Appendix  Table  18,  A%  —  1.023,  Ds  -  0,  and  D4  =  2.575. 

Control  limits  for  the  mean  x  are 


64.452  ±  (1.023)(6.28)  =  “  = 
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Control  limits  for  the  range  R  are 

(D6R  —  (0)  (6.28)  —  0 
{D<R  =  (2.575)(6.28)  =  16.17 

Figures  7.4  and  7.5  are  control  charts  using  these  limits. 
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Chapter  8 

ACCEPTANCE  SAMPLING 

8.0.  Introduction 

The  use  of  statistical  charts  by  a  producer  in  controlling  quality  during 
production  was  explained  in  Chapter  7.  This  chapter  will  deal  with 
acceptance  sampling;  that  is,  with  methods  by  which  the  buyer  may 
discriminate  between  acceptable  and  unacceptable  lots  of  product.  The 
methods  also  produce  incidental  information  that  can  be  useful  in  the 
control  of  quality.  (So-called  "continuous  sampling”  does  not  require  divi¬ 
sion  of  the  product  into  lots,  but  uses  samples  drawn  directly  from  the 
\  production  line.  See  Ref.  8.4.) 

\  If  a  buyer  finds  that  it  is  too  expensive,  too  difficult,  or  impossible  to 

j  screen  a  production  lot  (that  is,  to  inspect  it  100%),  he  may  decide  from 

!  a  sample  whether  to  accept  or  to  reject  the  entire  lot.  A  Navy  inspector  may 

|  decide  on  the  basis  of  a  sample  from  each  lot  of  fuzes  delivered  by  a  con- 

f  tractor  whether  to  accept  the  production  lot. 

\ 

)  Acceptance  sampling  may  be  based  on  either  attributes  or  variables. 

An  inspector  checking  fuzes  for  delay  times,  for  instance,  might  judge  each 
fuze  as  satisfactory  or  defective  (attributes),  or  he  might  record  the  delay 
|  time  for  each  sample  fuze  and  compute  a  statistic  for  deciding  whether 
j  to  accept  the  whole  lot  (variables).  (See  also  Sec.  7.0.) 

j  In  the  attributes  case,  the  lot  is  rejected  if  the  sample  contains  too  many 

!  defectives.  In  the  variables  case,  the  criterion  may  be  one-sided  or  two- 
i  sided,  depending  upon  specifications.  If,  for  example,  the  specifications  set 
an  upper  limit  U  on  the  acceptable  delay  time,  the  inspector  computes  the 
statistic  ( U  —  x)/s  +  10  from  the  delay  times  of  the  sample  and  rejects 
the  lot  if  this  quantity  is  less  than  an  acceptance  constant  Av;  otherwise  he 
accepts  the  lot.  If,  instead,  the  specifications  set  a  lower  limit  L,  the 
j  inspector  rejects  the  lot  if  the  statistic  (L  -  x)/s  +  10  exceeds  an  acceptance 
constant  AL;  otherwise  he  accepts  the  lot.  In  case  the  specifications  set  both 

{ 

9 

( 
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upper  and  lower  limits  on  the  delay  time,  the  inspector  should  use  a 
two-sided  criterion  rather  than  two  one-sided  criteria.  References  8.2  and 
8.3  provide  slightly  different  plans  for  this  case,  that  of  Ref.  8.2  being 
the  more  accurate  and  that  of  Ref.  8.3  being  the  more  easily  applied. 

Variables  inspection  may  provide  for  bounds  on  x  alone  or  on  r  alone, 
instead  of  on  a  combination.  For  these  techniques,  see  Ref.  8.1,  pages 
79-116. 

The  plans  commonly  used  by  naval  ordnance  inspectors  for  acceptance 
sampling  by  attributes  are  given  in  Ref.  8.5;  those  for  acceptance  sampling 
by  variables,  in  Ref.  8.3.  A  collection  of  variables  plans  is  given  also  in 
Ref.  8.2.  This  chapter  will  acquaint  the  reader  with  the  plans  given  in  Ref. 
8.2,  8.3,  and  8.5,  and  explain  briefly  the  construction  of  such  plans. 

8.1.  Attributes  or  Variables 

The  decision  whether  to  use  attributes  or  to  use  variables  may  be  made 
on  the  basis  of  over-all  cost,  keeping  in  mind  the  following  points: 

1.  A  larger  sample  is  required  for  attributes  sampling  than  for  variables 
sampling  to  obtain  equivalent  discrimination  between  good  and  bad  lots. 
Therefore,  variables  inspection  is  preferred  if  sample  items  are  costly  and 
inspection  is  destructive.  This  consideration  is  important,  for  instance,  in 
the  case  of  a  weapon  that  is  expensive  or  in  short  supply. 

2.  The  actual  measurements  and  computations  required  for  variables 
inspection  may  be  more  costly  than  the  yes-or-no  decision  and  tallying 
required  for  attributes  testing.  This  must  be  taken  into  account  whenever 
the  testing  itself  is  difficult,  expensive,  or  time-consuming. 

3.  Variables  methods  produce  as  a  by-product  information  that  can  be 
valuable  in  diagnosing  production  ills. 

4.  Variables  plans  depend  for  exactness  on  an  assumption  of  normality 
in  the  distribution  of  the  variable  measured,  though  the  plans  may  be  used 
as  approximate  methods  when  the  distribution  departs  from  normality. 
Attributes  plans  are  not  subject  to  such  a  restriction.  (See  Sec.  4.3  for  tests 
of  normality.) 

5.  Attributes  sampling  is  more  widely  known  than  variables  sampling 
and  therefore  may  require  less  training  of  inspectors. 
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8.2.  Choice  of  Plan 

There  are  many  possible  bases  for  choosing  one  sampling  plan  rather 
than  another.  In  some  cases  a  physical  situation  or  the  limitations  of  a 
budget  may  dictate  the  sample  size  or  other  aspects  of  the  plan.  The  methods 
given  here,  which  follow  those  of  Ref.  8.2,  8.3,  and  8.5,  base  the  choice 
of  a  plan  on  setting  the  risks  of  rejecting  a  good  lot  or  of  accepting  a 
bad  one. 

Having  decided  what  is  to  constitute  an  inspection  lot  of  the  product 
(this  may  be  a  production  lot,  part  of  a  production  lot,  or  several  produc¬ 
tion  lots  taken  together),  what  is  to  constitute  a  unit  or  item  in  the  lot,  and 
whether  to  test  by  attributes  or  by  variables,  we  must  decide  on  (1)  an 
acceptable-quality  level,  (2)  the  number  of  groups  of  items  to  be  sampled, 
and  (3)  the  inspection  level.  These  factors  will  be  dealt  with  in  the  sections 
that  follow. 

8.2.1.  Accoptablo-Quality  Lovtl  (AQL) 

Each  sampling  plan  has  the  property  that  the  more  defectives  a  lot 
contains  the  more  likely  it  is  to  be  rejected.  Since  the  inspection  is  not 
100%,  however,  lots  with  a  few  defectives  will  often  be  accepted.  The 
AQL  is  the  percentage  of  defective  items  in  a  lot  that  we  are  willing  to 
tolerate  most  of  the  time.  The  plans  in  Ref.  8.3  are  arranged  so  that,  on 
the  average,  lots  with  AQL%  defective  will  be  accepted  from  90%  of  the 
time  for  small  lots  to  99%  for  large  lots;  the  plans  in  Ref.  8.5  will  accept 
lots  having  AQI.%  defective  from  80%  of  the  time  for  small  lots  to  99.8% 
for  large  lots;  and  the  plans  in  Ref.  8.2  will  accept  lots  having  AQI.% 
defective  95%  of  the  time. 

In  Ref.  8.5,  a  range  of  AQL  values  is  approximated  by  one  value  in  that 
range.  For  instance,  to  apph'  the  plans  for  any  specified  AQL  from  0.040 
to  0.069%,  we  use  the  tabled  value  for  AQL  0.065%.  (See  Table  I  in 
Ref.  8.5  for  these  representative  AQL's.)  In  Ref.  8.5,  AQL’s  over  10.0% 
refer  only  to  the  number  of  defects  per  hundred  items  (allowing  several 
defect's  per  item),  rather  than  to  the  percentage  of  defective  items  in  the 
lot.  Reference  8.3  gives  plans  for  six  AQL’s:  0.10,  0.25,  1.0,  2.5,  4.0, 
and  6.5%.  Reference  8.2  gives  plans  for  15  ranges  of  AQL. 
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8.2.2.  Single,  Double,  or  Multiple  Sampling 

A  buyer  may  make  his  decision  to  accept  or  reject  a  lot  on  the  basis  of 
one  sample;  that  is,  he  may  use  a  single-sampling  plan.  Or  he  may  use 
a  double-sampling  plan,  under  which  he  draws  a  first  sample  and  then 
makes  one  of  the  following  decisions.  If  the  first  sample  is  sufficiently 
good,  he  will  accept  the  lot  without  testing  further;  if  it  is  sufficiently  poor, 
he  will  reject  the  lot;  if  it  is  intermediate,  he  will  inspect  a  second  sample 
before  deciding.  A  multiple-sampling  plan  extends  the  same  idea  to 
several  samples  drawn  in  sequence  from  the  same  lot,  with  the  possibility 
of  a  decision  on  the  lot  (without  further  sampling)  at  each  stage. 

An  attributes  plan  specifies  an  acceptance  number  Ac  and  a  rejection 
number  Re  for  each  step  of  sampling.  If  the  sample  has  Ac  or  fewer 
defective  items,  the  buyer  accepts  the  lot  without  further  sampling.  If  the 
sample  has  Re  or  more  defective  items,  he  rejects  the  lot  without  further 
sampling.  If  the  number  of  defectives  is  between  Ac  and  Re,  he  draws 
another  sample  before  deciding.  After  each  successive  sampling,  he  makes 
a  decision  on  the  basis  of  the  total  sample  accumulated,  using  the  acceptance 
and  rejection  numbers  for  that  stage.  For  the  last  possible  sample  in  each 
plan  (the  only  sample  for  a  single-sampling  plan,  the  second  sample  for  a 
double-sampling  plan,  and  the  final  sample  for  a  multiple-sampling  plan) 
we  have  Re  =  Ac  +  l,  so  that  the  lot  will  either  be  accepted  (Ac  or  fewer 
defective  sample  items)  or  rejected  (Re  or  more  defectives)  with  no  further 
sampling. 

For  variables  inspection,  only  single-  and  double-sampling  plans  are 
available;  see  Ref.  8.2  and  8.3.  To  carry  out,  for  example,  a  double- 
sampling  plan,  the  buyer  makes  use  of  constants  ka,  kr,  and  kt  given  in 
Ref.  8.2.  He  computes  for  the  first  sample  the  mean 


and  standard  deviation 


Si  = 


If  Xi  +  kas1  <  U  (upper  limit),  he  accepts  the  lot.  If  x1  +  krs1  >  U,  he 
rejects  the  lot.  If  neither  decision  can  be  made,  he  draws  a  second  sample. 
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Computing 

-  _  2*i +  2*2 

Xf  nx  4-  «2 

and 


i  +  »a) (2*i  +  2*1)  ~  (2*i  +  2*2)2 
(»i  +  #a)  (»i  +  «2  “  1) 


for  the  total  sample,  he  accepts  the  lot  if  xt  4-  ktst  <  U  and  rejects  it  if 
xt  +  ktst  >  JJ. 

In  contrast  to  the  single-sampling  plan,  a  plan  that  allows  several  samples 
does  not  specify  in  advance  how  many  items  will  be  required,  since  the 
number  depends  on  whether  one  of  the  early  samples  yields  a  decision. 
However,  the  maximum  number  of  items  that  could  possibly  be  needed  for 
the  total  sample  is  specified  by  the  plan.  The  average  sample  number  for  a 
given  lot  quality  can  be  computed  (with  some  difficulty)  by  formulas  given 
in  Ref.  8.9,  pp.  203-11,  for  double  or  multiple  sampling  by  attributes.  For 
double  sampling  by  variables,  the  average  sample  number  can  be  computed 
by  formulas  given  in  Ref.  8.2,  p.  128.  This  average  sample  number  is 
usually  smaller  than  the  fixed  sample  size  of  a  single-sampling  plan,  because 
a  multiple  plan  gives  the  buyer  a  chance  to  make  his  decision  on  the  basis 
of  one  of  the  relatively  small  early  samples  if  it  happen  to  be  poor  enough 
or  good  enough. 


8.2.3.  Inspection  Level 

After  a  decision  has  been  made  to  use  a  single-,  double-,  or  multiple¬ 
sampling  plan,  an  inspection  level  must  be  chosen.  Three  levels  of 
inspection,  designated  I,  II,  and  III,  correspond  to  the  different  levels  of  the 
importance  of  detecting  defective  items  at  the  expense  of  large  sample 
sizes.  Inspection  level  III  is  appropriate  if  defectives  must  be  rejected 
whenever  possible,  regardless  of  the  size  of  the  sample,  as  might  be  the 
case  when  safety  from  blowups  is  a  consideration.  If,  on  the  other  hand, 
the  cost  of  testing  is  unusually  high  and  the  acceptance  of  some  defectives 
is  not  a  serious  matter,  inspection  level  I  is  appropriate.  Level  II,  a  com¬ 
promise,  should  be  used  unless  there  is  a  special  need  for  one  of  the 
other  levels. 
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The  choice  of  an  inspection  level  and  of  the  entire  sampling  plan  should 
be  made  only  after  careful  consideration  of  the  risks  of  error  involved,  the 
cost  of  inspection,  and  the  costs  associated  with  incorrect  decisions.  The 
risks  of  error  of  a  particular  plan  may  be  obtained  from  an  operating- 
characteristic  (OC)  curve,  as  described  in  Sec.  8.3  below.  Special  problems, 
for  instance  those  involving  safety,  may  require  construction  of  special  plans. 

8.2.4.  Sample  Size  and  Severity  of  Inspection 

Table  III  of  Ref.  8.5,  Table  III  of  Ref.  8.3,  or  Table  A  of  Ref.  8.2  will 
yield  the  sample-size  code  letter  for  a  given  lot  size.  This  code  letter  is 
required  when  entering  later  tables  for  the  characteristics  of  the  desired 
sampling  plan,  in  particular  the  sample  size. 

The  severity  of  inspection  influences  the  code  letter  and  hence  the 
characteristics  of  the  plan  itself.  Unless  previous  inspection  has  indicated 
otherwise,  normal  inspection  is  used.  References  8.2,  8.3,  and  8.5  provide 
also  for  reduced  inspection  and  tightened  inspection  at  each  level,  and 
the  criteria  for  their  use.  The  standard  of  inspection  may  be  reduced 
when  the  quality  of  production  has  been  uniformly  good  over  a  period  of 
time,  or  tightened  when  the  quality  has  been  poor. 

Example.  For  an  attributes  plan  for  a  lot  of  size  100,  we  find  from 

Table  III  of  Ref.  8.5  the  code  letter  F  for  normal  inspection  at  level  II. 

For  a  variables  plan,  we  find  from  Table  III  of  Ref.  8.3  the  code  letter  B 

for  normal  inspection  at  level  II. 

If  the  sample  size  n  is  a  constant  proportion  of  the  lot  size  N,  the  cost 
of  inspection  per  unit  produced  remains  constant;  however,  the  risks  of 
wrong  decisions  are  not  constant,  but  increase  with  decreasing  lot  size.  In 
the  plans  of  Ref.  8.2,  8.3,  and  8.5,  therefore,  the  sample  sizes  are  not 
decreased  in  proportion  to  the  lot  sizes,  but  are  given  compromise  values 
to  keep  the  risks  of  rejecting  good  lots  or  of  accepting  bad  lots  from  rising 
inordinately  as  the  lot  sizes  are  decreased.  In  acceptance  sampling  demand¬ 
ing  particular  attention  to  the  risks,  as  in  the  case  of  safety  considerations,  it 
may  be  preferable  to  choose  the  sample  size  n  (in  single  sampling)  accord¬ 
ing  to  the  approximation 


N  +  n„ 


n  = 
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where  N  is  the  lot  size  and  ««,  is  the  sample  size  that  would  give  the 
specified  risks  for  an  infinitely  large  lot  size. 


8.2.5.  Plans  for  Normal  Inspection 

The  normal  inspection  plan  corresponding  to  the  specified  AQL  and 
sample-size  code  letter  can  now  be  found  as  follows. 

a.  Attributes.  For  the  acceptance  and  rejection  numbers  of  an  attributes- 
sampling  plan,  turn  to  Ref.  8.5,  and  enter  Table  IV-A  for  a  single-sampling 
plan;  Table  IV-B  for  a  double-sampling  plan;  and  Table  IV-C  for  a 
multiple-sampling  plan. 

Example.  A  buyer  needs  an  attributes  double-sampling  plan  to  test 
lots  of  7,000  items  each.  He  is  willing  to  accept,  most  of  the  time,  lots 
that  contain  0.30%  defective  items.  What  is  the  plan  recommended  by 
Ref.  8.5  for  such  a  case? 

From  Table  I  of  Ref.  8.5,  for  a  specified  AQL  falling  between  0.280 
and  0.439%  the  representative  AQL  value  is  0.40%.  F*^m  Table  III 
of  Ref.  8.5,  for  a  lot  size  of  3,201  to  8,000  the  sample-size  code  letter 
is  M  for  inspection  level  II.  For  convenience,  part  of  Table  IV-B 
(double-sampling  plan)  from  Ref.  8.5  is  reproduced  as  Table  8.1,  and 
the  appropriate  figures  have  been  italicized. 


The  buyer  is  instructed  to  draw  a  random  sample  of  150  items  from 
the  lot  to  be  tested.  If  among  the  150  he  finds  0,  1,  or  2  defective  items, 
he  should  accept  the  lot.  If  he  finds  5,  6,  7,  or  more  defective  items,  he 
should  reject  the  lot.  If  he  finds  3  or  4  defective  items  in  the  first  sample, 
he  must  draw  a  second  sample  in  order  to  reach  a  decision.  The  second 
sample  should  contain  300  items,  making  a  total  of  450.  If  the  number 
of  defectives  from  the  first  sample  plus  the  number  from  the  second  is 
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4  or  fewer,  the  buyer  should  accept  the  lot.  If  the  total  number  of 
defectives  is  5  or  more,  he  should  reject  the  lot.  Notice  that  he  will 
either  accept  or  reject  at  this  stage,  if  he  has  not  done,  so  before,  so  that 
no  further  sampling  is  required. 


b.  Variables.  To  get  the  acceptance  constant  Av  or  Ah  of  a  variables- 
sampling  plan,  enter  Table  IV  in  Ref.  8.3  (which  deals  with  single-sampling 
plans  only)  with  the  desired  AQL  value  and  the  sample-size  code  letter. 


In  operation,  the  single-sampling  plan  calls  for  calculating  from  the 
sample 

n 

=i _ 

n 
and 


x  =  ±^ 


_ 

\  n(i 


(2*iY 


!(»-  1) 

If  U  is  the  specified  upper  limit  of  the  measurement  x  for  a  nondefective 
item,  compute  the  lot-quality  index 

U-x 


Cu 


+  10 


and  accept  the  lot  if  Cv  >  Av  or  reject  the  lot  if  Cv  <  Av.  For  a  lower 
specification  limit  L,  compute  the  lot-quality  index 


c»“- f2*10 

and  accept  the  lot  if  C*,  <  A h  or  reject  the  lot  if  CL  >Ah, 

Example.  Suppose  that  the  Navy  buys  explosive  charges  from  a 
contractor.  Since  a  charge  having  a  heat  of  explosion  less  than  400 
calories  per  g4am  is  considered  acceptable,  the  buyer  is  willing  to  accept, 
most  of  the  time,  lots  in  which  not  more  than  0.25%  of  the  charges 
have  heats  of  explosion  greater  than  400  calories  per  gram.  The  charges 
are  easy  to  test,  and  it  is  imperative  that  faulty  charges  be  discovered. 
There  are  2,000  charges  in  each  lot.  What  is  the  single-sampling  plan 
recommended  by  Ref.  8.3  ? 

From  the  statement  of  the  problem,  the  upper  specification  limit  U 
is  400  calories  per  gram,  the  AQL  is  0.25%,  and  the  inspection  level 
is  III. 
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From  Table  III  of  Ref.  8.3,  the  sample-size  code  letter  is  F  for  normal 
inspection  at  level  III  for  a  lot  of  size  1,301  to  8,000.  Part  of  Table  IV 
of  Ref.  8.3  is  reproduced  as  Table  8.2.  From  this  table  we  find  that  for 
AQL  0.25%  the  acceptance  constant  Av  is  12.45. 


TABLE  8.2.  Excerpt  From  Table  IV  of  Ref.  8.3 


Sample-size  code  letter 

Sample  size 

AQL  for  normal  or 
reduced  inspection 

0.10% 

0.25% 

1.0% 

Av 

Av 

Av 

F 

130 

12.70 

12.45 

12.01 

To  dispose  of  each  lot  of  2,000  charges,  the  buyer  draws  a  random 
sample  of  130.  He  measures  the  heat  of  explosion  for  each  one,  and 
calculates 


and 


He  computes 


1  130 

*  =  T55,?,X‘ 


_  |i3QS*S  ~  (S *<)' 

\  (130)  (129) 


C„  =  +  10 


and  compares  it  with  the  acceptance  constant  Av—  12.45,  accepting  the 
lot  if  Cv  >  12.45,  and  rejecting  the  lot  if  Cv  <  12.45. 


8.3.  Operating-Characteristic  (OC  Curve  and  Its 
Use  in  Designing  Plans 

Since  the  plans  described  in  this  chapter  involve  sampling,  there  is  a 
risk  of  error.  References  8.2,  8.3,  and  8.5  give  operating-characteristic  (OC) 
curves  for  the  plans  they  include.  These  OC  curves  should  be  considered 
carefully,  because  the  method  for  determining  the  plan  to  use  does  not 
fully  describe  the  plan  itself,  especially  with  respect  to  the  buyer’s  risk  of 
accepting  lots  he  considers  definitely  unsatisfactory.  The  OC  curve  shows 
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for  each  possible  percentage  of  defective  items  in  a  lot  the  probability  of 
accepting  a  lot  of  that  quality.  The  ideal  plan  would  have  a  steep  curve, 
which  would  discriminate  sharply  between  good  lots  and  bad  lots.  (This 
is  a  special  use  for  the  OC  curve  described  in  Sec.  1.1.9.  Examples  of  such 
curves  appear  in  Appendix  Charts  V,  VI,  VII,  and  VIII.) 

One  important  quantity  available  from  the  OC  curve  is  the  producer's 
risk  a.  For  the  lot  proportion  defective  px  specified  by  the  AQL,  the  curve 
gives  the  probability  of  accepting  a  lot  of  that  quality.  This  probability 
is  1  -  a.  (See  Fig.  8.1.)  For  the  plans  of  Ref.  8.2  the  producer’s  risk  is 
5%,  so  that  each  curve  passes  through  the  point  (AQL,  0.95).  As  mentioned 
in  Sec.  8.2.1,  however,  this  is  not  true  for  the  plans  of  Ref.  8.3  and  8.5. 

It  is  also  important  to  check  the  OC  curve  for  high  proportions  of 
defective  items;  for  instance,  the  point  (p2,  /?),  where  p2  is  a  definitely 
unsatisfactory  proportion  defective  and  /?  is  the  consumer's  risk  of 
accepting  a  lot  of  that  quality.  A  commonly  used  pair  of  coordinates  is 
(LTPD,  0.10),  where  LTPD  stands  for  lot  tolerance  percent  defective 
and  10%  is  a  conventional  choice  for  the  consumer’s  risk. 

Peach  (Ref.  8.8)  gives  a  method  for  designing  a  single-  or  a  double¬ 
sampling  plan  by  attributes  with  an  OC  curve  that  passes  near  the  two 
given  points  (pu  0.95)  and  (p2,  0.05).  Following  this  method,  with 
px  =  AQL,  form  the  operating  ratio 


For  a  single-sampling  plan,  enter  Appendix  Table  19(<*),  using  R0  or  the 
next  larger  tabled  value  to  obtain  the  acceptance  number  Ac  and  npx. 
Divide  npx  by  px  to  get  the  sample  size  n.  (Re  =  Ac  +  1.) 

To  determine  the  actual  abscissas  p*  and  p2*  for  the  ordinates  0.95  and 
0.05  of  the  OC  curve  of  this  plan,  let  c  =  Ac,  and  compute 


r  c  +  1  +  (n  -  c)  Fx 

t  *  = _ ±C±_M. 

n  -  c  +  (c  +  1)  F2 

where  Fx  is  the  5%  point  of  the  F  distribution  with  fx  =  2  (n  —  c)  and 
f2  =  2  (c  +  1)  degrees  of  freedom,  and  F2  is  the  5%  point  of  the  F  dis- 
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FIG.  8.1.  Typical  Operating-Characteristic  Curve. 

tribution  with  fx  =  2  (c  +  1)  and  /.,  =  2  (n  -  c)  degrees  of  freedom.  (See 
Appendix  Table  5.)  If  these  actual  values  do  not  come  close  enough  to  the 
desired  pi  and  p2,  they  can  be  adjusted  by  changes  in  Ac  and  n.  Since  npx 
remains  nearly  constant,  an  increase  in  n  produces  a  decrease  in  p*. 
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For  a  double-sampling  plan,  enter  Appendix  Table  1 9(b),  using  i?,,  or  the 
next  larger  tabled  value  to  obtain  Acu  Rex,  Ac2,  and  ntpt.  Divide  n1pl  by 
px  to  get  nlt  the  size  of  the  first  sample.  If  the  number  of  defectives  in  the 
first  sample  is  greater  than  Acx  but  less  than  Relt  take  a  second  sample  of 
size  n2  =  nv  For  the  total  sample,  the  acceptance  number  is  Ac2  and  the 
rejection  number  is  Re2  =  Acz  4-  1.  (The  double-sampling  plans  of  Ref. 
8.5  and  8.9  differ  from  this  plan  in  having  n2  r-  2 nt  and  Re2  =  Rev) 

Example.  Design  a  double-sampling  plan  that  will  accept  lots  con¬ 
taining  10%  defective  items  95%  of  the  time,  and  lots  containing  35% 
defective  items  only  5%  of  the  time. 

The  OC  curve  of  the  desired  test  is  to  pass  through  the  points 
(0.10,  0.95)  and  (0.35,  0,05);  that  is.  px  =  0.10  and  p2  =  0.35.  We 
compute  the  operating  ratio 


n  -h- °-35  _ 

R‘  f,  -  0.10  ” 3-5 

From  Appendix  Table  19(&),  for  R0  =  3.5,  we  read 

Acx  =  2  Rex  —  6  Act  =  7  nxpx  =  1.99 
Therefore 


and 


^ir^oTo-19-9-20 

n2  —  nx~  20 


Accordingly,  the  first  random  sample  is  made  up  of  20  items.  If  it  con¬ 
tains  0,  1,  or  2  defectives,  the  lot  is  accepted.  If  it  contains  6  or  more 
defectives,  the  lot  is  rejected.  If  it  contains  3,  4,  or  5  defectives,  20  more 
items  are  drawn  at  random  from  the  lot.  If  the  combined  sample  of  40 
contains  7  or  fewer  defectives,  the  lot  is  accepted;  otherwise,  it  is  rejected. 


8.4.  Sequential  Plans 

In  double-  and  multiple-sampling  plans,  w'e  take  advantage  of  the  fact 
that  a  very  poor  lot  or  a  very  good  lot  can  be  expected  to  reveal  its  character 
in  a  small  first  sample;  usually,  more  extensive  sampling  is  necessary  only 
for  lots  of  medium  quality.  This  idea  has  been  exploited  still  further  in 
sequential  analysis,  which  calls  for  drawing  one  sample  item  at  a  time 
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with  the  possibility  of  a  decision  on  the  lot  after  each  drawing.  (Actually, 
the  method  can  also  be  used  if  items  are  in  groups,  as  in  multiple  sampling. 
In  this  case,  the  average  increase  in  sample  size  over  pure  sequential 
sampling  is  no  greater  than  the  group  size.) 

To  perform  a  sequential  test  for,  say,  acceptance  sampling  by  attributes, 
the  inspector  needs  a  chart  similar  to  Fig.  8.2,  on  which  there  are  two 
slanting  parallel  lines.  The  abscissa  represents  the  number  of  items  drawn 
and  the  ordinate  represents  the  number  of  defective  items  drawn.  As  the 
inspector  draws  and  tests  each  item,  he  plots  a  point  on  the  chart  one  unit 
to  the  right  if  the  item  is  not  defective;  one  unit  to  the  right  and  one  unit 
up  if  the  item  is  defective.  If  a  continuing  path  connecting  all  the  points 
crosses  the  upper  parallel  line,  the  inspector  will  reject  the  lot.  If  it  crosses 
the  lower  parallel  line,  he  will  accept  the  lot.  If  it  remains  between  the  two 
lines,  he  will  draw  another  sample  item. 

This  technique  is  economical  in  the  use  of  sample  items,  is  easy  to  apply, 
and  is  adaptable  to  the  following: 

1.  Acceptance  sampling  using  attributes  (Sec.  8.4.1) 

2.  Making  a  decision,  based  on  attributes,  between  two  methods  or 
products 

3.  Making  one-sided  and  two-sided  significance  tests  for  the  mean  of  a 
normal  distribution 

4.  Making  a  one-sided  significance  test  for  the  standard  deviation  of  a 
normal  distribution  (Sec.  8.4.2) 

Sequential  analysis  is  not  appropriate  when  it  is  inconvenient  to  wait  for 
test  results  on  one  item  at  a  time,  or  when  testing  is  inexpensive  compared 
with  producing  items  for  test,  so  that  it  is  preferable  to  test  all  provided 
items  in  a  sample  of  fixed  size.  References  8.7,  8.10,  and  8.12  give  clear 
expositions  of  the  situations  listed  above,  in  addition  to  the  following 
treatments  of  situations  1  and  4. 

8.4.1.  Sequential  Plan  for  Proportion  Defective  p 

To  construct  a  chart  for  use  in  a  sequential-sampling  plan  for  proportion 
defective  p  with  OC  curve  through  the  points  (pi,  1  —  a)  and  (p2,  /?),  we 
draw  on  graph  paper  a  horizontal  axis  n  to  represent  the  number  of  items 
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drawn,  and  a  vertical  axis  d  to  represent  the  number  of  defective  items 
drawn.  On  these  axes  we  draw  the  parallel  lines 


dx  =  —hx  +  sn 

(lower) 

d-i-  h2  +  sn 

(upper) 

To  find  the  intercepts  -  and  h2  and  the  common  slope  s,  we  first  compute 
the  auxiliary  quantities 

=  log  ^ 

P  i 

-pPl 

i  1 

.-log  a 

*=i°*  V 

(Taking  all  logarithms  to  the  base  10,  we  find  the  values  of  a  and  b  in 
Appendix  Table  20  for  common  values  of  a  and  /?.)  Then  we  have 

h  -  b 

u  —  * 

1  +  & 

*  gi  +  gi 

Once  the  chart  has  been  drawn,  its  use  is  simple.  For  each  item  inspected, 
the  point  («,  d)  is  plotted.  This  point  shows  how  many  items  have  been 
inspected  and  how  many  of  them  were  found  defective.  Notice  that  n  will 
increase  by  one  each  time,  moving  the  point  one  unit  to  the  right.  If  the 
item  is  defective,  the  point  will  also  move  up  one  unit.  As  the  sampling 
continues,  the  series  of  points  plotted  may  be  connected  by  a  continuing 
path.  If  at  any  point  this  path  crosses  or  meets  the  lower  of  the  parallel 
lines,  we  accept  the  lot.  If  it  crosses  or  meets  the  upper  line,  we  reject  the 
lot.  As  long  as  it  remains  between  the  lines,  we  continue  sampling. 

A  truncated  sequential-sampling  plan  may  be  used  to  prevent  the 
possibility  of  requiring  a  very  large  sample — a  possibility  that  is  present  in 
the  ordinary  sequential  plan,  especially  for  borderline  lots.  If  we  agree  to 
stop  sampling  when 

3  ab 
n  — - 

gifc 

the  change  in  the  risks  a  and  ($  of  the  ordinary  plan  will  be  negligible, 
because  it  is  rarely  necessary  to  test  this  number  of  samples  before  a  decision 
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is  reached.  If  n  gets  this  large  with  no  decision,  we  accept  the  lot  if  the 
vertical  distance  from  the  last  point  to  the  lower  line  is  less  than  the  vertical 
distance  from  the  point  to  the  upper  line.  Otherwise  we  reject  the  lot. 

Example.  Design  a  truncated  sequential-sampling  plan  that  will  reject 
only  5%  of  the  time  lots  containing  0.1%  defective  items,  but  will  reject 
90%  of  the  time  lots  containing  5%  defective  items. 

We  compute  gx  ~  1.69897  and  g2  =  0.02183.  From  Appendix  Table 
20  wc  have  a  =  1.255  and  b  =  0.978.  Then 


0*78 _ =  *978  =  0.,68 

1  1.69897  +  0.02185  1.721 


1.235 

1.721 


0.729 


s 


0.02185 

1.721 


=  0.01270 


To  truncate  the  plan,  we  agree  to  stop  sampling  after 


— 


3(1.255)(0.978) 
(1.69897)  (0.02185) 


=  99.2  -  99 


items  have  been  inspected,  accepting  if  the  continuing  path  is  closer  to 
the  acceptance  (lower)  line  and  rejecting  if  the  path  is  closer  to  the 
rejection  (upper)  line. 


Figure  8.2  shows  the  sampling  chart  used  for  a  hypothetical  sample  of 
99.  The  sampling  technique  led  to  acceptance  of  the  lot. 


NUMBER  OF  ITEMS  SAMPLED,  n 


FIG.  8.2.  Truncated  Sequential-Sampling  Chart  for  an  Attributes  Inspection  Plan. 
The  operating-characteristic  curve  passes  through  (0.001,  0.95)  and  (0.05,  0.10). 
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8.4.2.  Sequential  Plan  for  Standard  Deviation  a 

If  the  measurements  of  a  lot  or  process  follow  a  normal  distribution  with 
unknown  mean  fi  and  standard  deviation  a,  we  can  construct  a  sequential 
method  for  testing  the  null  hypothesis  that  <r  has  a  hypothetical  value  <r1 
against  the  alternative  a  —  a2  >  (See  Sec.  3.2.2  for  the  non-sequential 
one-sided  test.)  Suppose  there  is  an  assigned  risk  a  of  rejecting  the  hypoth¬ 
esis  when  a  does  equal  <ru  and  an  assigned  risk  j8  of  accepting  the  hypothesis 
when  actually  a  has  the  higher  value  cr2- 

Let  n  stand  for  the  number  of  sample  items  inspected,  and  let  Z  stand 
for  the  sum  of  squared  deviations  from  the  sample  mean.  Then 


The  latter  form  is  to  be  used  for  computing.  Draw  n  and  Z  axes  on  ordinary 
rectangular  graph  paper.  The  acceptance  and  rejection  lines  are  then 

Zj  =  —ht  +  s  (n  -  1)  (lower) 

Za  =  h2  +  s  (n  -  1)  (upper) 

respectively.  To  find  the  intercepts  —hx  and  h2  and  the  common  slope  s, 
we  first  compute  the  auxiliary  quantities 


£  =  0.43429 


(H) 


a  =  log! 


b  =  logi 


1  —  a 


Then  we  have 


10  \a2/ 


Appendix  Table  20  may  be  used  to  get  a  and  b  for  common  values  of 
a  and  /?. 
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The  chart  for  this  technique,  shown  as  Fig.  8.3,  should  be  used  in  the  same 
manner  as  the  chart  for  the  proportion-defective  test  given  in  Sec.  8.4.1. 


FIG  8.3.  Typical  Sequential-Sampling  Chart  for  Testing  the  Hypothesis  That 
it  —  a i  Against  the  Alternative,  a  =  at  >  aj. 
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APPENDIX  TABLES 
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TABLE  1.  Cumulative  Normal  Distribution* 


e~*i/2dz 


z 

.00 

.01 

.02 

.03 

.04 

.05 

.06 

.07 

.08 

.09 

.0 

.5000 

.5040 

.5080 

.5120 

.5160 

.5*99 

.5239 

.5279 

.5319 

.5359 

.i 

.5398 

.5438 

.5478 

.5517 

.5557 

.5596 

.5636 

.5675 

.5714 

.5753 

.2 

.5793 

.5832 

.5871 

.5910 

.5948 

.3987 

.6026 

.6064 

.6103 

.6141 

.3 

.6179 

.6217 

.6255 

.6293 

.6331 

.6368 

.6406 

.6443 

.6480 

.6517 

,  .4 

.6554 

.6591 

.6628 

.6664 

.6700 

.6736 

.6772 

.6808 

.6844 

.6879 

.5 

.8915 

.6950 

.6985 

.7019 

.7054 

.7088 

.7123 

.7157 

.7190 

.7224 

.6 

.7257 

.7291 

.7324 

.7357 

.7389 

.7422 

.7454 

.7486 

.7517 

.7549 

.7 

-7580 

.7611 

.7642 

.7673 

.7704 

.7734 

.7764 

.7794 

.7823 

.7852 

.8 

.7881 

.7910 

.7939 

.7967 

.7995 

.8023 

.8051 

.8078 

.8106 

.8133 

.9 

.8159 

.8186 

.8212 

.8238 

.8264 

.8289 

.8315 

.8340 

.8365 

.8389 

1.0 

.8413 

/ 

.8438 

.8461 

.8485 

.8508 

.8531 

.8554 

.8577 

.8599 

.8621 

1.1 

.8643 

.8665 

.8686 

.8708 

.8729 

.8749 

.8770 

.8790 

.8810 

.8830 

1.2 

.8849 

.8869 

.8888 

.8907 

.8925 

.8944 

.8962 

.8980 

.8997 

.9015 

1.3 

.9032 

.9049 

.9066 

.9082 

.9099 

.9115 

.9131 

.9147 

.9162 

.9177 

1.4 

.9192 

.9207 

.9222 

.9236 

.9251 

.9265 

.9279 

.9292 

.9306 

.9319 

1.5 

.9332 

.9345 

.9357 

.9370 

.9382 

.9394 

.9406 

.9418 

.9429 

.9441 

1.8 

.9452 

.9463 

.9474 

.9484 

.9495 

.9505 

.9515 

.9525 

.9535 

.9545 

1.7 

.9554 

.9564 

.9573' 

.9582 

.9591 

.9599 

.9608 

.9616 

.9625 

.9633 

1.8 

.9641 

.9649 

.9656 

.9664 

.9671 

.9678 

.9686 

.9693 

.9699 

.9706 

1.9 

.9713 

.9719 

.9728 

.9732 

.9738 

.9744 

.9750 

.9756 

.9761 

.9767 

2.0 

.9772 

.9778 

.9783 

.9788 

.9793 

.9798 

.9803 

.9808 

.9812 

.9817 

2.1 

.9821 

.9826 

.9830 

.9834 

.9838 

.9842 

.9846 

.9850 

.9854 

.9857 

2.2 

.9861 

.9864 

.9868 

.9871 

.9875 

.9878 

.9881 

.9884 

.9887 

.9890 

2.3 

.9893 

.9898 

.9898 

.9901 

.9904 

.9906 

.9909 

.9911 

.9913 

.9916 

2.4 

.9918 

.9920 

.9922 

.9925 

.9927 

.9929 

.9931 

.9932 

.9934 

.9936 

2.5 

.9938 

.9940 

.9941 

.9943 

.9945 

.9946 

.9948 

.9949 

.9951 

.9952 

2.6 

.9953 

.9955 

.9956 

.9957 

.9959 

.9960 

.9961 

.9962 

.9963 

.9964 

2.7 

.9965 

.9966 

.9967 

.9968 

.9969 

.9970 

.9971 

.9972 

.9973 

.9974 

2.8 

.9974 

.9975 

.9976 

.9977 

.9977 

.9978 

.9979 

.9979 

.9980 

.9981 

2.9 

.9981 

.9982 

.9982 

.9983 

.9984 

.9984 

.9985 

.9985 

.9986 

.9986 

3.0 

.9987 

.9987 

.9987 

.9988 

.9988 

.9989 

.9989 

.9989 

.9990 

.9990 

3.1 

.9990 

.9991 

.9991 

.9991 

.9992 

.9992 

.9992 

.9992 

.9993 

.9993 

3.2 

.9993 

.9993 

.9994 

.9994 

.9994 

.9994 

.9994 

.9995 

.9995 

.9995 

3.3 

.9995 

.9995 

.9995 

.9996 

.9996 

.9996 

.9996 

.9996 

.9996 

.9997 

3.4 

.9997 

.9997 

.9997 

.9997 

.9997 

.9997 

.9997 

.9997 

.9997 

.9998 

9  Use  explained  in  Sec.  1.2.1.  For  more  extensive  tables,  see  National  Bureau  of  Standards, 
Tables  of  Normal  Probability  Functions,  Washington,  U.  S.  Government  Printing  Office,  1953 
(Applied  Mathematics  Series  23).  Note  that  they  show 


f  f(z)dz,  not  f  f(z)dz 

-Z  J- 00 


JBKCaXLNG  PAG#  BLANK-NO?  FILMED 


** 


TABLE  2.  Percentiles  of  the  Normal 
Distribution* 


•  Use  explained  in  Sec.  1.2.1.  For  a  normally  distributed  variable  x,  we 
have  x  —  a  +  *ff»  where  "•  —  mean  of  x  and  a  —  standard  deviation  of  x. 
For  more  extensive  tables,  See  R.  A.  Fisher  and  F.  Yates,  Statistical  Tables, 
4th  rev.  ed.,  Edinburgh,  Oliver  &  Boyd,  Ltd.,  1953,  pp,  39,  60—62. 
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TABLE  3.  Upper  Percentage  Points  of  the  t  Distribution* 


P(0  = 


wJ5=L 

/,  (^)j  ™ 


(i  +  t2/fyit+'v*dt 


.40 

.30 

.25 

.20 

.15 

.10 

.05 

.025 

.01 

.005 

.0005 

1 

.325 

.727 

1.000 

1.376 

1.963 

3.078 

6.314 

12.706 

31.821 

63.657 

636.619 

2 

.289 

.617 

.816 

1.061 

1.386 

1.886 

2.920 

4.303 

6.965 

£.925 

31.598 

3 

.277 

.584 

.765 

.978 

1.250 

1.638 

2.353 

3.182 

4.541 

5,841 

12.924 

4 

.271 

.569 

.741 

.941 

1.190 

1.533 

2.132 

2.776 

3.747 

4.604 

3.610 

5 

.267 

.559 

.727 

.920 

1.156 

1.476 

2.015 

2.571 

3.365 

4.032 

6.869 

6 

.265 

.553 

.718 

.906 

1.134 

1.440 

1.943 

2.447 

3.143 

3.707 

5.959 

7 

.263 

.549 

.711 

.896 

1.119 

1.415 

1.895 

2.365 

2.998 

3.499 

5.408 

8 

.262 

.546 

.706 

.889 

1.108 

1.397 

1.860 

2.306 

2.896 

3.355 

5.041 

9 

.261 

.543 

.703 

.883 

1.100 

1.383 

1.833 

2.262 

2.821 

3.250 

4.781 

10 

.260 

.542 

.700 

.879 

1.093 

1.372 

1.812 

2.228 

2.764 

3.169 

4.587 

11 

.260 

.540 

.697 

.876 

1.088 

1.363 

1.796 

2.201 

2.718 

3.106 

4.437 

12 

.259 

.539 

.695 

.873 

1.083 

1.356 

1.782 

2.179 

2.081 

3.055 

4.318 

13 

.259 

.538 

.694 

.870 

1.079 

1.350 

1.771 

2.160 

2.650 

3.012 

4.221 

14 

.258 

.537 

.692 

.868 

1.076 

1.345 

1.761 

2.145 

2.624 

2.977 

4.140 

15 

.258 

.536 

.691 

.866 

1.074 

1.341 

1.753 

2.131 

2.602 

2.947 

4.073 

16 

.258 

.535 

.690 

.865 

1.071 

1.337 

1.746 

2.120 

2.583 

2.921 

4.015 

17 

.257 

.534 

.689 

.863 

1.069 

1.333 

1.740 

2,110 

2.567 

2.898 

3.965 

18 

.257 

.534 

.688 

.862 

1.067 

1.330 

1.734 

2,101 

2.552 

2.878 

3.922 

19 

.257 

.533 

.689 

.861 

1.066 

1.328 

1.729 

2.093 

2.539 

2.861 

3.883 

20 

.257 

.533 

.687 

.860 

1.064 

1.325 

1.725 

2.086 

2.528 

2.845 

3.850 

21 

.257 

.532 

.686 

.859 

1.063 

1.323 

1.721 

2.080 

2.518 

2.831 

3.819 

22 

.256 

.532 

.686 

.858 

1.061 

1.321 

1.717 

2.074 

2.508 

2.819 

3.792 

23 

.256 

.532 

.685 

.858 

1.060 

1.319 

1.714 

2.069 

2.500 

2.807 

3.767 

24 

.256 

.531 

.685 

.857 

1.059 

1.318 

1.711 

2.064 

2.492 

2.797 

3.745 

25 

.236 

.531 

.684 

.856 

1.058 

1.316 

1.708 

2.060 

2.485 

2.787 

3.725 

26 

.256 

.531 

.684 

.856 

1.058 

1.315 

1.706 

2.056 

2.479 

2.779 

3.707 

27 

.256 

.531 

.684 

.855 

1.057 

1.314 

1.703 

2.052 

2.473 

2.771 

3.690 

28 

.256 

.530 

.683 

.855 

1.056 

1.313 

1.701 

2.048 

2.467 

2.763 

3.674 

29 

.256 

.530 

.683 

.854 

1.055 

1.311 

1.699 

2,045 

2.462 

2.756 

3.659 

30 

.256 

.530 

.683 

.854 

1.055 

1.310 

1.697 

2.042 

2.457 

2.750 

3.646 

40 

.255 

.529 

.681 

.851 

1.050 

1.303 

1.684 

2.021 

2.423 

2.704 

3.551 

60 

.254 

.527 

.679 

.848 

1.046 

1.296 

1.671 

2.000 

2.390 

2.660 

3.460 

120 

.254 

.526 

.677 

.845 

1.041 

1.289 

1.658 

1.980 

2.358 

2.617 

3.373 

GO 

.253 

.524 

.674 

.842 

1.036 

1.282 

1.645 

1.960 

2.326 

2.576 

3.291 

*  Use  explained  in  Sec.  2.2.2.  For  a  two-sided  («-qual-tails)  test  the  significance  levels  are  twice 
the  above  column  headings.  The  reciprocals  of  the  numbers  of  degrees  of  freedom,  rather  than 
the  numbers  themselves,  should  be  used  for  linear  interpolation.  To  calculate  the  upper  0.5%  point 
for  f  —  34,  use 


2.704  + 


(1/34)  -  (1/40) 
(1/30)  -  (1/40) 


(0.046)  =  2.728 


which  is  correct  to  3  decimal  places,  whereas  ordinary  linear  interpolation  would  give  2.732.  This 
table  was  adapted,  with  the  permission  of  the  authors  and  the  publishers,  from  R.  A.  Fisher  and 
F.  Yates,  Statistical  Tables,  4tu  rev.  ed.,  Edinburgh,  Oliver  &  Boyd,  Ltd.,  1953,  p.  40.  (The  original 
table  is  given  in  terms  of  the  two-sided  test  probabilities. ) 


TABLE  4.  Upper  Percentage  Points  of  the  x2  Distribution  * 
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TABLE  7.  Critical  Absolute  Values  of  Correlation 

Coefficient  r* 


*  Use  explained  in  Sec.  6.1.4a,  6.2.5b,  and  6.2.5c,  The  inverse  square  roots  of  the  numbers 
of  degrees  of  freedom,  rather  than  the  numbers  themselves,  should  be  used  for  linear  interpo¬ 
lation.  Reproduced,  with  the  permission  of  the  author  and  the  publisher,  from  G.  W.  Snedecor, 
Statistical  Methods,  4th  ed.,  Ames,  Iowa,  Iowa  State  College  Press,  1946,  p.  351.  Extensive 
tables  and  charts  appear  in  F.  N.  David,  Tables  of  the  Correlation  Coefficient,  London, 
Biometrika  Office,  1938. 
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TABLE  8.  Factors  for  Determining  Confidence  Limits  for 
Population  Standard  Deviation  <t* 


f  —  n  —  1  for  single  sample  size  n 

f  =  «i  +  «j  +  *  •  •  +  »t  —  k  for  k  sample  sizes  of  «!,•••  ,  »* 


f 

90% 

95% 

99% 

bi 

bz 

bi  b • 

bi  b 2 

1 

.510 

15.947 

.446 

31.910 

.356 

159.576 

2 

.578 

4.415 

.521 

6.285 

.434 

14.124 

3 

.620 

2,920 

.566 

3.729 

.483 

6.468 

4 

.649 

2.372 

.599 

2.874 

.519 

4.396 

5 

.672 

2.089 

.624 

2.453 

.546 

3.485 

6 

.690 

1.915 

.644 

2.202 

.569 

2.980 

7 

.705 

1.797 

.661 

2.035 

.588 

2.660 

8 

.718 

1.711 

.675 

1.916 

.604 

2.439 

9 

.729 

1.645 

.688 

1.826 

.618 

2.278 

10 

.739 

1.593 

.699 

1.755 

.630 

2.154 

11 

.748 

1.551 

.708 

1.698 

.641 

2.056 

12 

.755 

1.515 

.717 

1,651 

,651 

1.976 

13 

.762 

1.485 

.725 

1.611 

.660 

1.910 

14 

.769 

1.460 

.732 

1.577 

.669 

1.854 

15 

.775 

1.437 

.739 

1.548 

.676 

1.806 

16 

.780 

1.418 

.745 

1.522 

.683 

1.764 

17 

.785 

1.400 

.750 

1.499 

.690 

1.727 

18 

.790 

1.384 

.756 

1.479 

.696 

1.695 

19 

.794 

1.370 

.760 

1.461 

.702 

1.668 

20 

.798 

1.358 

.765 

1.444 

.707 

1.640 

21 

.802 

1.346 

.769 

1.429 

.712 

1.617 

22 

.805 

1.335 

.773 

1.415 

.717 

1.595 

23 

.809 

1.326 

.777 

1.403 

.722 

1.576 

24 

.812 

1.316 

.781 

1.391 

.726 

1.558 

25 

,815 

1.308 

.784 

1.380 

.730 

1.542 

26 

.818 

1.300 

.788 

1.370 

.734 

1.526 

27 

.820 

1.293 

.791 

1.361 

.737 

1.512 

28 

.823 

1.286 

.794 

1.352 

.741 

1.499 

29 

.825 

1.280 

.796 

1.344 

.744 

1.487 

30 

.828 

1.274 

.799 

1.337 

.748 

1.475 

40 

.847 

1.228 

.821 

1.280 

.774 

1.390 

50 

.861 

1.199 

.837 

1.243 

.793 

1.337 

60 

.871 

1.179 

.849 

1.217 

.808 

1.299 

70 

.879 

1.163 

.858 

1.198 

.820 

1.272 

80 

.886 

1.151 

.866 

1.183 

.829 

1.250 

90 

.892 

1.141 

.873 

1.171 

.838 

1.233 

100 

.897 

1.133 

.879 

1.161 

.845 

1.219 

>100 

1 

1 

1 

1  ±  1.645/  VZf 

1±  1.960/ V2 f 

1  ±  2.578/ V2f 

0  Use  explained  in  Sec.  3,2.3.  To  obtain  the  limits,  multiply  sample  standard  deviation  * 
by  bi  and  by  ba.  This  table  was  prepared  by  Eleanor  G.  Crow,  NOTS. 
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TABLE  9.  Critical  Values  for  the  Sign  Test* 


N.  a 

-.01 

.05 

.10 

.25 

Nw  a 

r\ 

.01 

.05 

.10 

.25 

i 

46 

■9 

15 

16 

18 

2 

.... 

47 

bk 

16 

17 

19 

3 

0 

48 

HJ9 

16 

17 

19 

4 

0 

49 

Hi 

17 

18 

19 

5 

0 

0 

50 

KB 

17 

18 

20 

6 

0 

0 

1 

51 

18 

19 

20 

7 

0 

0 

1 

52 

Bn 

18 

19 

21 

8 

0 

0 

l 

1 

53 

Bn 

18 

20 

21 

9 

0 

1 

l 

2 

54 

Bn 

19 

20 

22 

0 

1 

l 

2 

55 

KB 

19 

20 

22 

11 

0 

1 

2 

3 

56 

17 

20 

21 

23 

12 

1 

2 

2 

3 

57 

18 

20 

21 

23 

13 

1 

2 

3 

3 

58 

18 

21 

22 

24 

14 

1 

2 

3 

4 

59 

19 

21 

22 

24 

15 

2 

3 

3 

4 

60 

19 

21 

23 

25 

16 

2 

3 

4 

5 

61 

20 

22 

23 

25 

17 

2 

4 

4 

5 

62 

20 

22 

24 

25 

18 

3 

4 

5 

6 

63 

20 

23 

24 

26 

19 

3 

4 

5 

6 

64 

21 

23 

24 

26 

20 

3 

5 

5 

6 

65 

21 

24 

25 

27 

21 

4 

5 

6 

7 

66 

22 

24 

25 

27 

22 

4 

5 

6 

7 

67 

22 

25 

26 

28 

23 

4 

6 

7 

8 

68 

22 

25 

26 

28 

24 

5 

6 

7 

8 

69 

23 

25 

27 

29 

25 

5 

7 

7 

9 

70 

23 

26 

27 

29 

26 

6 

7 

8 

9 

71 

24 

26 

28 

30 

27 

6 

7 

8 

10 

72 

24 

27 

28 

30 

28 

6 

8 

9 

10 

73 

25 

27 

28 

31 

29 

7 

8 

9 

10 

74 

25 

28 

29 

31 

-10 

7 

9 

10 

11 

75 

25 

28 

29 

32 

I >1 

7 

9 

mm 

11 

76 

26 

28 

30 

32 

32 

8 

9 

■9 

12 

77 

26 

29 

30 

32 

33 

8 

ii 

12 

78 

27 

29 

31 

33 

34 

9 

Bn 

ii 

13 

79 

27 

30 

31 

33 

35 

9 

ii 

12 

13 

80 

28 

30 

32 

34 

36 

9 

u 

12 

14 

81 

28 

31 

32 

34 

37 

12 

13 

14 

82 

28 

31 

33 

35 

38 

12 

13 

14 

83 

29 

32 

33 

35 

39 

11 

12 

13 

15 

84 

29 

32 

33 

36 

40 

13 

14 

15 

85 

30 

32 

34 

36 

41 

mm 

n 

14 

16 

86 

30 

33 

34 

37 

42 

m  B1 

12 

15 

16 

87 

31 

33 

35 

37 

43 

KB 

15 

17 

88 

31 

34 

35 

38 

44 

■9 

Bn 

16 

17 

89 

31 

34 

36 

38 

45 

III 

kI 

16 

18 

90 

32 

35 

36 

39 

°  Use  explained  in  Sec.  2,5.2a.  In  the  sign  test  the  test  statistic  is  the  number  of  +  signs 
or  the  number  of  —  signs,  whichever  is  smaller.  A  statistic  less  than  or  equal  to  the  critical 
value  tabled  under  a  will  occur  with  probability  not  more  than  a.  For  values  of  N  larger 
than  90,  approximate  critical  values  may  be  found  by  taking  the  nearest  integer  less  than 
( N  -  l)/2  -  k\N  +  T,  where  k  is  1.2879,  0.9800,  0.8224,  and  0.5752  for  the  1,  5,  10, 
and  25%  values,  respectively.  Reproduced,  with  the  permission  of  the  authors  and  the 
publisher,  from  W,  J.  Dixon  and  F.  J.  Massey,  Jr.,  Introduction  to  Statistical  Analysis,  New 
York,  McGraw-Hill,  1951,  p.  324. 


n 


1 


j 


*  l 


5  C 

6  6 


5  6 

5  0 

5  6 

8  6 


7  8  8  9  9  10  10  11  . 

7  8  9  9  10  10  11  11  11  . . 

8  8  9  9  10  10  11  11  12  12  . 

8  8  9  10  10  11  11  12  12  13  13  .... 

8  9  9  10  10  11  12  12  13  13  13  14 


m  —  m 

0.875 

ni  —  ns 

0.875 

m  —  ns 

0.875 

21 

15 

32 

24 

55 

45 

22 

16 

34 

26 

60 

49 

23 

16 

36 

28 

65 

54 

24 

17 

38 

30 

70 

58 

25 

18 

40 

31 

75 

63 

20 

19 

42 

33 

80 

68 

27 

20 

44 

35 

85 

72 

28 

21 

46 

37 

90 

77 

29 

22 

48 

38 

95 

82 

30 

22 

50 

40 

100 

86 

•  Use  explained  in  Sec.  4.1  and  4.7.2.  The  values  listed  are  such  that  a  number  less  than  or  equal 
to  v.«i5  will  occur  with  probability  not  more  than  2.5%;  a  number  greater  than  or  equal  to  0.025  will 
will  occur  with  probability  not  more  than  2,5%.  A  number  less  than  or  equal  to  t>.*s  will  occur  with 
probability  not  more  than  5%;  a  number  greater  than  or  eaual  to  0.05  will  occur  with  probability 
not  more  than  5%.  Adapted,  with  the  permission  of  the  authors  and  the  editor,  from  C.  Eisenhart 
and  F.  Swed,  “Tables  for  Testing  Randomness  of  Grouping  in  a  Sequence  of  Alternatives,”  ANN 
MATH  STAT,  Vol.  14  (1943),  pp.  83-86. 


i 

/■* 
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TABLE  10.  (Contd.) 

V.ms 


l 

ni  —  nt 

0.025 

ni  —  ni 

O.025 

ni  =  uj 

t),025 

i 

21 

29 

32 

42 

55 

67 

* 

22 

30 

34 

44 

60 

73 

» 

23 

32 

36 

16 

65 

78 

* 

24 

33 

38 

48 

70 

84 

w 

25 

34 

40 

51 

75 

89 

i 

26 

35 

42 

53 

80 

94 

\ 

27 

36 

44 

55 

85 

100 

X 

28 

37 

46 

57 

90 

105 

29 

38 

48 

60 

95 

110 

> 

f 

30 

40 

50 

62 

100 

116 

TABLE  10.  (Contd.) 


Pm 


X 

2 

3 

4 

5 

6 

7 

8 

9 

10 

11 

12  13 

14 

15 

16 

17 

18 

19 

20 

2 

3 

4 

7 

8 

..... 

5 

9 

9 

.... 

.... 

.... 

.... 

. 

.... 

8 

9 

10 

11 

7 

9 

10 

11 

12 

..... 

8 

11 

12 

13 

13 

9 

11 

12 

13 

14 

14 

.... 

11 

12 

13 

14 

15 

18 

....  .... 

.... 

... 

..... 

11 

....  13 

14 

15 

15 

16 

17 

12 

....  13 

14 

15 

18 

17 

17 

18 

13 

....  13 

14 

15 

16 

17 

18 

18 

19 

14 

....  13 

14 

16 

17 

17 

18 

19 

20 

20 

15 

15 

16 

17 

18 

19 

19 

20 

21 

21 

.... 

.... 

18 

15 

16 

17 

18 

19 

20 

21 

21 

22 

23 

17 

....  .... 

15 

18 

17 

18 

19 

20 

21 

22 

22 

23 

24 

18 

15 

16 

18 

19 

20 

21 

21 

22 

23 

24 

24 

25 

19 

....  ....  .... 

15 

16 

18 

19 

20 

21 

22 

23 

23 

24 

25 

25 

26 

17 

18 

19 

20 

21 

22 

23 

24 

25 

25 

26 

27 

27 

ni  —  m 

U.OJ 

ni  —  n> 

V.05 

m  —  m 

t>.0S 

t 

21 

28 

32 

41 

55 

66 

t 

5 

22 

29 

34 

43 

60 

71 

23 

31 

36 

45 

65 

76 

24 

32 

38 

47 

70 

82 

25 

33 

40 

49 

75 

87 

* 

26 

34 

42 

51 

80 

92 

27 

35 

44 

54 

85 

98 

< 

28 

38 

46 

56 

90 

103 

) 

29 

37 

48 

58 

95 

108 

' 

38 

50 

60 

100 

114 

v  \&r  r"' 
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TABLE  11.  Half- Widths  da  for  Construction  of 
Confidence  Bands  for  Cumulative  Distributions  * 


n 

.20 

.15 

.10 

.05 

.01 

5 

.45 

.47 

.51 

.56 

.67 

10 

.32 

.34 

.37 

.41 

.49 

20 

.23 

.25 

.26 

.29 

.36 

25 

.21 

.22 

.24 

.27 

.32 

30 

.19 

.20 

.22 

.24 

.29 

35 

.18 

.19 

.20 

.23 

.27 

40 

.17 

.18 

.19 

.21 

.25 

45 

.16 

.17 

.18 

.20 

.24 

50 

.15 

.16 

.17 

.19 

.23 

>50 

1.07 

1.14 

1.22 

1.36 

1.63 

\Ti 

VTi 

V» 

VTT 

V« 

0  Use  explained  in  Sec.  4.4.  Reproduced,  with  the  permission  of  the 
authors  and  the  publisher,  from  W.  J.  Dixon  and  F.  J.  Massey,  Jr.,  Intro¬ 
duction  to  Statistical  Analysis,  New  York,  McGraw-Hill,  1951,  p.  348. 


TABLE  12.  Factor  k  for  Estimating  Normal 
Standard  Deviation  From  the  Range  w  as  kw* 


n 

k 

n 

k 

2 

.886 

8 

.351 

3 

.591 

9 

.337 

4 

.486 

10 

.325 

5 

.430 

50 

.222 

6 

.395 

100 

.199 

7 

.370 

1000 

.154 

0  Use  explained  in  Sec.  1.1.4b.  The  factors  for  n  —  2,  3,  ....  10  are 
reproduced,  with  the  permission  of  the  authors  and  the  publisher,  from 
W.  J.  Dixon  and  F.  J.  Massey,  Jr.,  Introduction  to  Statistical  Analysis, 
New  York,  McGraw-Hill,  1951,  p.  315.  The  factors  for  n  =  50,  100,  and 
1000  are  reciprocals  of  entries  in  a  table  of  mean  ranger  complete  to 
n  —  1000  in  L.  H.  C.  Tippett,  “On  the  Extreme  Individuals  and  the 
Range  of  Samples  Taken  From  a  Normal  Population,”  BIOMETRIKA, 
Vol.  17  (1925),  pp.  364-87. 
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TABLE  13.  Critical  Values  for  Tests  Using 
the  Range  w  * 


(a)  Critical  Values  for  n  —  - — - 

XV 


n 


.05 

.025 

.01 

.005 

3.16 

6.35 

15.91 

31.83 

2.11 

.529 

1.32 

.388 

■3ili 

Mil ; 

.312 

.399 

.523 

.628 

.263 

.429 

.507 

I'jpf 

.288 

.366 

.429 

.255 

.322 

.374 

.186 

.288 

(b)  Critical  Values  for  n  -  — L- — 1 

W\  -j-  Wz 


a 

ni  —  nN^ 

.05 

.025 

.01 

.005 

2  ! 

1.10 

1.71 

2.78 

3.96 

3 

0.487 

0.638 

0.857 

1.05 

4 

.322 

.406 

.524 

0.618 

5 

.246 

.306 

.386 

.448 

8 

.203 

,250 

.310 

.357 

7 

.173 

.213 

.263 

.300 

8 

.153 

.186 

.229 

.260 

9 

.137 

.167 

.204 

.232 

10 

.125 

.152 

.185 

.210 

0  Use  explained  in  Sec.  2.2.1  and  2.5.3a  respectively.  Adapted,  with  the 
permission  of  the  author  and  the  publisher,  from  E.  Lord,  “The  Use  of 
the  Range  in  Place  of  the  Standard  Dev.ation  in  the  t  Test,”  8IOMET- 
RIKA,  Vol.  34  ( 1947),  p.  41. 
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TABLE  14.  Critical  Values  for  82/s2  * 


For  P  =  .50,  SVs}  =  2  for  all  L 


\r 
*  x 

.999 

.99 

.95 

.05 

.01 

.001 

4 

.5898 

.6256 

.7805 

3.2195 

3.3744 

3.4102 

5 

.4161 

.5379 

.8204 

3.1796 

3.4621 

3.5839 

6 

.3634 

.5615 

.8902 

3.1098 

3.4385 

3.6366 

7 

.3695 

.6140 

.9359 

3.0641 

3.3860 

3.8305 

8 

.4036 

.6628 

.9825 

3.0175 

3.3372 

3.5964 

9 

.4420 

.7088 

1.0244 

2.9756 

3.2911 

3.5580 

10 

.4816 

.7518 

1.0623 

2.9378 

3.2482 

3.5184 

11 

.5197 

.7915 

1.0965 

2.9035 

3.2085 

3.4803 

12 

.5557 

.8280 

1.1276 

2.8724 

3.1720 

3.4443 

13 

.3898 

.8618 

1.1558 

2.8442 

3.1381 

3.4102 

14 

.6223 

.8931 

1.1816 

2.8184 

3.1068 

3.3777 

15 

.6532 

.9221 

1.2053 

2.7947 

3X77S 

3.3467 

16 

.6826 

.9491 

1.2272 

2.7728 

3.0509 

3.3174 

17 

.7104 

.9743 

1.2473 

2.7527 

3.0257 

3.2896 

18 

.7o68 

.9979 

1.2660 

2.7340 

3.0021 

3.2632 

19 

.7617 

1.0199 

1.2834 

2.7166 

2.9800 

3.2383 

20 

.7852 

1.040C 

1.2996 

2.7004 

2.9593 

3.2148 

21 

.8073 

1.0601 

1.3148 

2.6852 

2.9399 

3.1927 

22 

.8283 

1.0785 

1.3290 

2.6710 

2.9216 

3.1718 

23 

.8481 

1.0958 

1.3425 

2.6576 

2.9042 

3.1520 

24 

.8668 

1.1122 

1.3552 

2.6449 

2.8877 

3.1333 

25 

.8846 

1,1278 

1.3671 

2.6329 

2.8722 

3.1154 

26 

.9017 

1.1426 

1.3785 

2.6215 

2.8575 

3.0983 

27 

.9182 

1.1567 

1.3892 

2.6108 

2.8434 

3.0818 

28 

.9341 

1.1702 

1.3994 

2.6006 

2.8300 

.1.0659 

29 

.9496 

1.1830 

1.4091 

2.3909 

2.8171 

3.0505 

30 

.9645 

1.1951 

1.4183 

2.5817 

2.8049 

3.0355 

31 

.9789 

1.2067 

1.4270 

2.5729 

2.7933 

3.0212 

32 

.9925 

1.2177 

1.4354 

2.5646 

2.7823 

3.0076 

33 

1.0055 

1.2283 

1.4434 

2.5566 

2.7717 

2.9936 

34 

1.0180 

1.2386 

1.4511 

2.5490 

2.7614 

2.9821 

35 

1.0300 

1.2485 

1.4585 

2,5416 

2.7515 

2.9701 

36 

1.0416 

1.2581 

1.4656 

2.5344 

2.7419 

2.9584 

37 

1.0529 

1.2673 

1.4726 

2.5274 

2.7325 

2,9469 

38 

1.0639 

1.2763 

1.4793 

2,5208 

2,7237 

2.9360 

39 

1.0746 

1.2850 

1.4858 

2.5142 

2.7151 

2.9254 

40 

1.0850 

1.2934 

1.4921 

2.5079 

2.7066 

2.9151 

41 

1.0950 

1.3017 

1.4982 

2.5018 

2.6983 

2.9050 

42 

1.1048 

1.3096 

1.5041 

2.4958 

2.6904 

2.8952 

43 

1.1142 

1.3172 

1.5098 

2.4901 

2.6827 

2.8858 

44 

1.1233 

i.3246 

1.5154 

2.4846 

2.6754 

2.8767 

45 

1.1320 

1.3317 

1.5206 

2.4794 

2.6683 

2.8680 

46 

1.1404 

1.3387 

1.5257 

2.4743 

2.6614 

2.8596 

47 

1.1484 

1.3453 

1.5305 

2.4695 

2.6548 

2.8516 

48 

1.1561 

1.3515 

1.5351 

2.4649 

2.6486 

2.8439 

49 

1.1635 

1.3573 

1.5395 

2.4604 

2.6426 

2.8365 

50 

1.2705 

1.3629 

1.5437 

2.4563 

2.6370 

2.8295 

51 

1.1774 

1.3683 

1.5477 

2,4522 

2.6316 

2.8225 

52 

1.1843 

1.3738 

1.5518 

2.4483 

2.6262 

2.8157 

53 

1.1910 

1.3792 

1.5557 

2.4444 

2.6208 

2.S090 

54 

1.1976 

1.3846 

1.5596 

2,4405 

2  6154 

2.8024 

55 

1.2041 

1.3899 

1.5634 

2.4368 

2.6102 

2.7959 

56 

1.2104 

1.3949 

1.5670 

2.4330 

2.6050 

2.7896 

57 

1.2166 

1.3999 

1.5707 

2.4294 

2.6001 

2.7834 

58 

1.2227 

1.4048 

1.5743 

2.4258 

2.5952 

2.7773 

59 

1.2283 

1.4096 

1.5779 

2.4222 

2.5903 

2.7712 

60 

1.2349 

1.4144 

1.5814 

2.4186 

2.5856 

2.7651 

•Use  explained  in  Sec.  2.8.2.  From  a  paper  by  A.  H.  J.  Baines,  “Methods  of  Detecting 
Non-Randomness  in  a  Given  Series  of  Observations,"  Technical  Report  -  Series  “R,” 
No.  Q.C./R/12,  British  Ministp'  of  Supply,  as  reproduced  in  Frankford  Arsenal,  Statistical 
Manual:  Methods  of  Making  Experimental  Inferences,  2nd  rev.  ed.,  by  C.  W.  Churchman, 
Philadelphia,  Frankford  Arsenal,  June  1951,  Table  VIII. 


TABLE  15.  Orthogonal  Polynomials* 
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TABLE  16.  Critical  Values  for  Test  Ratios  for  Gross  Errors  * 


Ratio 

Number  n 

Critical  values 

of  observations 

a  =  .10 

a  =  .05 

a  =  .01 

3 

.941 

.970 

.994 

XI  —  Xi 

4 

.765 

.829 

.926 

no  —  1  . — 

X*  —  X| 

5 

.642 

.710 

.821 

6 

.560 

.628 

.740 

7 

.507 

.569 

.680 

8 

.544 

.608 

.717 

i! 

H 

u» 

1 

H 

9 

.503 

.564 

.672 

10 

.470 

.530 

.635 

X»-l  —  Xi 

11 

.445 

.502 

.605 

12 

.423 

.479 

.579 

13 

.563 

.611 

.697 

_  Xi  —  X\ 

rn  — 

14 

.539 

.586 

.670 

15 

.518 

.565 

.647 

*«-l  —  Xj 

16 

.500 

.546 

.627 

17 

.483 

.529 

.810 

18 

.469 

.514 

.594 

19 

.457 

.501 

.580 

20 

.446 

.489 

.567 

21 

.435 

.478 

.555 

22 

.426 

.468 

.544 

23 

.418 

.459 

.535 

24 

.410 

.451 

.526 

25 

.402 

.443 

.517 

26 

.396 

.436 

.510 

27 

.389 

.429 

.502 

28 

.383 

.423 

.495 

29 

.378 

.417 

.489 

30 

.373 

.412 

.483 

•  Use  explained  in  Sec.  4.8.  Values  for  a  =  .01  and  .10,  n  =:  3,  4,  5,  6,  and  7  are 
adapted  from  W,  I.  Dixon,  “Ratios  Involving  Extreme  Values,"  ANN  MATH  STAT,  Vol. 
22  (1951),  pp.  68-78.  Remaining  values  calculated  by  R.  S.  Gardner,  NOTS.  The  above 
values  apply  when  deviations  at  either  end  of  the  sample  are  of  interest,  and  it  is  not 
specified  before  seeing  the  data  wL'ch  end  it  may  be;  Dixon’s  tables  apply  when  it  is  known 
before  seeing  the  data  which  end  of  the  sample  is  of  interest. 
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TABLE  17.  Tolerance  Factors  K  for  Population 
Proportion  P  of  Normal  Distributions* 


tt  - 

0.10 

(t  - 

0.05 

rt  — 

0.10 

a  — 

0.05 

\  p 
n  \ 

0.90 

0.95 

0.90 

0.95 

n 

0.90 

0.95 

0.90 

0.95 

55 

1.901 

2.265 

1.976 

2.354 

2 

15.978 

18.800 

32.019 

37.674 

60 

1.887 

2.248 

1.958 

2.333 

3 

5.847 

6.919 

8.380 

9.916 

65 

1.875 

2.235 

1.943 

2.315 

4 

4.166 

4.943 

5.369 

6.370 

70 

1.865 

2.222 

1.929 

2.299 

5 

3.494 

4.152 

4.275 

5.079 

75 

1.856 

2.211 

1.917 

2.285 

6 

3.131 

3.723 

3.712 

4.414 

80 

1.848 

2.202 

1.907 

2.272 

7 

2.902 

3.452 

3.369 

4.007 

85 

1.841 

2.193 

1.897 

2.261 

8 

2.743 

3.264 

3.136 

3.732 

90 

1.834 

2.185 

1.889 

2.251 

9 

2.626 

3.125 

2.967 

3.532 

95 

1.828 

2.178 

1.881 

2.241 

10 

2.535 

3.018 

2.839 

3.379 

100 

1.822 

2.172 

1.874 

2.233 

11 

2.463 

2.933 

2.737 

3.259 

110 

1.813 

2.160 

1.861 

2.218 

12 

2.404 

2.863 

2.655 

3.162 

120 

1.804 

2.150 

1.850 

2.205 

13 

2.355 

2.805 

2.587 

3.081 

130 

1.797 

2.141 

1.841 

2.194 

14 

2.314 

2.756 

2.529 

3.012 

140 

1.791 

2.134 

1.833 

2.184 

15 

2.278 

2.713 

2.480 

2.954 

150 

1.785 

2.127 

1.825 

2.175 

16 

2.246 

2.676 

2.437 

2.903 

160 

1.780 

2.121 

1.819 

2.167 

17 

2.219 

2.643 

2.400 

2.858 

170 

1.775 

2.116 

J  .813 

2.160 

18 

2.194 

2.614 

2.366 

2.819 

180 

1.771 

2.111 

1.808 

2.154 

19 

2.172 

2.588 

2.337 

2.784 

190 

1.767 

2.106 

1.803 

2.148 

20 

2.152 

2.564 

2.310 

2.752 

200 

1.764 

2.102 

1.798 

2.143 

21 

2.135 

2.543 

2.288 

2.723 

225 

1.756 

2.093 

1.788 

2.131 

22 

2.118 

2.524 

2.264 

2.697 

250 

1.750 

2.085 

1.780 

2.121 

23 

2.103 

2.506 

2.244 

2.673 

275 

1.745 

2.079 

1.773 

2.113 

24 

2.089 

2.489 

2.225 

2.651 

300 

1.740 

2.073 

1.767 

2.106 

25 

2.077 

2.474 

2.208 

2.631 

26 

2.065 

2.460 

2.193 

2.612 

350 

1.732 

2.064 

1.757 

2.094 

27 

2.054 

2.447 

2.178 

2.595 

400 

1.726 

2.057 

1.749 

2.084 

28 

2.044 

2.435 

2.164 

2.579 

450 

1.721 

2.051 

1.743 

2.077 

29 

2.034 

2.424 

2.152 

2.564 

500 

1.717 

2.046 

1.737 

2.070 

30 

2.025 

2.413 

2.140 

2.549 

31 

2.017 

2.403 

2.129 

2.536 

550 

1.713 

2.041 

1.733 

2.065 

32 

2.009 

2.393 

2.118 

2.524 

600 

1.710 

2.038 

1.729 

2.060 

33 

2.001 

2.385 

2.108 

2.512 

050 

1.707 

2.034 

1.725 

2.056 

34 

1.994 

2.376 

2.099 

2.501 

700 

1.705 

2.032 

1.722 

2.052 

35 

1.988 

2.368 

2.090 

2.490 

750 

1.703 

2.029 

i.719 

2.049 

36 

1.981 

2.361 

2.081 

2.479 

800 

1.701 

2.027 

1.717 

2.046 

37 

1.975 

2.353 

2.073 

2.470 

850 

1.699 

2.025 

1.714 

2.043 

38 

1.969 

2.346 

2.068 

2.464 

900 

1.697 

2.023 

1.712 

2.040 

39 

1.964 

2.340 

2.060 

2.455 

950 

1.696 

2.021 

1.710 

2.038 

40 

1.959 

2.334 

2.052 

2.445 

1000 

1.695 

2.019 

1.709 

2.036 

45 

1.935 

2.306 

2.021 

2.408 

00 

1.645 

1.960 

1.645 

1.960 

50 

1.916 

2.284 

1.996 

2.379 

•  Use  explained  in  Sec.  4.9.  Abridged,  with  the  permission  of  the  author  and  the  publisher, 
from  A.  H.  Bowker,  “Tolerance  Limits  for  Normal  Distributions,”  in  Selected  Techniques  of 
Statistical  Analysis,  ed.  by  Eisenhart,  Hastay,  and  Wallis,  New  York,  McGraw-Hill,  1947, 
pp.  97-110.  The  original  reference  contains  five  P  values  for  each  of  four  a  values  ana 
includes  more  n  values. 


TABLE  18.  Factors  for  Computing  Control-Chart  Lines 
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TABLE  19.  Constants  for  a  Sampling  Plan  With  OC  Curve 
Through  (pu  0.95)  and  ( p2 ,  0.05)  * 


(a)  Single  Sampling  Plan 


i 


1 

Operating  ratio 

Ho  =  pa/pi 

— 

Aci 

Re  i 

Aci 

nipi 

1 

15.1 

0 

2 

1 

0.207 

8.3 

0 

3 

2 

* 

5.1 

1 

4 

4 

i 

4.1 

2 

5 

6 

1.63 

t 

3.5 

2 

6 

7 

1.99 

3 

8 

9 

2.77 

2.6 

5 

12 

13 

4.34 

i 

? 

2.3 

6 

14 

16 

5.51 

2.02 

9 

18 

23 

8.38 

1.82 

13 

24 

32 

12.19 

* 

1.61 

21 

35 

50 

1.505 

30 

46 

69 

28.53 

f 

? 

1.338 

63 

84 

138 

60.31 

•Use  explained  in  Sec.  8.3.  Reproduced  with  the  permission  of  the  author  and  the  pub¬ 
lishers  from  Paul  Peach,  An  Introduction  to  Industrial  Statistics  and  Quality  Control,  2nd  ed., 
Raleigh,  N.  C.,  Edwards  &  Broughton  Co.,  1947,  pp.  228-29. 
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TABLE  20.  Table  of  a-  log10  and  b  =  log^—-2 
The  upper  number  in  each  ceil  represents  a ,  the  lower  number,  b. 


a 

0.001 

0.010 

0.050 

0.100 

0.200 

0.001 

3.000 

2.000 

1.301 

1.000 

0.699 

3.000 

2.996 

2.978 

2.954 

2.903 

0.010 

2.996 

1.996 

1.297 

0.996 

0.695 

2.000 

1.996 

1.978 

1.954 

1.903 

0.050 

2.978 

1.978 

1.279 

0.978 

0.677 

1.301 

1.297 

1.279 

1.255 

1.204 

0.100 

2.954 

1.954 

1.255 

0.954 

0.653 

1.000 

0.996 

0.978 

0.954 

0.903 

0.200 

2.903 

1.903 

1.204 

0.903 

0.602 

0.699 

0.695 

0.677 

0.653 

0.602 
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TABLE  21.  Confidence  Limits  for  a  Proportion 


(a)  Two-Sided  Limits* 

Upper  limits  are  in  boldface.  The  observed  proportion  in  a  random  sample  is  r/n. 


r 

90% 

95% 

99% 

r 

90% 

95% 

99% 

n  = 

1 

n  =  2 

0 

0  .900 

0 

.950 

0 

.990 

0 

0  .684 

0  .776 

0 

.900 

1 

.100  1 

.050 

1 

.010 

1 

1 

.051  .949 

.025+  .975- 

.005+ 

.995- 

2 

.316  1 

.224  1 

.100 

1 

n  = 

3 

■>*> 

II 

e 

0 

0  .538 

0 

.632 

0 

.785- 

0 

0  .500 

0  .527 

0 

.684 

1 

.035-  .804 

.017 

.865- 

.003 

.941 

1 

.026  .680 

.013  .751 

.003 

.859 

2 

.196  .965+ 

.135+ 

.983 

.059 

.997 

2 

.143  .857 

.098  .902 

.042 

.958 

3 

.464  1 

.368 

1 

.215+ 

1 

3 

.320  .974 

.249  .987 

.141 

.997 

4 

.500  1 

.473  1 

.316 

1 

n  = 

5 

n  =  6 

0 

0  .379 

0 

.500 

0 

.602 

0 

0  .345- 

0  .402 

0 

.536 

I 

.021  .621 

.010 

.657 

.002 

.778 

1 

.017  .542 

.009  .598 

.002 

.706 

2 

.112  .753 

.076 

.811 

.033 

.894 

2 

.093  .067 

.063  .729 

.027 

.827 

3 

.247  .888 

.189 

.924 

.106 

.967 

3 

.201  .799 

.153  .847 

.085- 

.915+ 

4 

.379  .979 

.343 

.990 

.222 

.998 

4 

.333  .907 

.271  .937 

.173 

.973 

5 

.621  1 

.500 

1 

.398 

1 

5 

.458  .983 

.402  .991 

.294 

.998 

6 

.655+  1 

.598  1 

.464 

1 

n  — 

7 

00 

II 

e 

0 

0  .316 

0 

.877 

0 

.500 

0 

0  .255- 

0  .315+ 

0 

.451 

1 

.015-  .500 

.007 

.554 

.001 

.643 

1 

.013  .418 

.006  .500 

.001 

.590 

2 

.079  .684 

.053 

.659 

.023 

.764 

2 

.069  .582 

.046  .685- 

.020 

.707 

3 

.170  .721 

.129 

.775- 

.071 

.858 

3 

.147  .745+ 

.111  .711 

.061 

.802 

4 

.279  .830 

.225+ 

.871 

.142 

.929 

4 

.240  .760 

.193  .807 

.121 

.879 

5 

.316  .921 

.341 

.947 

.236 

.977 

5 

.255-  .853 

.289  .889 

.198 

.030 

8 

.500  .985+ 

.446 

.993 

.357 

.999 

6 

.418  .931 

.315+  .954 

.293 

.980 

7 

.684  1 

.623 

1 

.500 

1 

7 

.582  .987 

.500  .994 

.410 

.999 

8 

.745+  1 

.685-  1 

.549 

1 

n  = 

9 

o 

H 

II 

e 

0 

0  .232 

0 

.289 

0 

.402 

0 

0  .222 

0  .267 

0 

.376 

1 

.012  .391 

.006 

.443 

.001 

.598 

1 

.010  .352 

.005+  .397 

.001 

.512 

2 

.061  .515+ 

.041 

.558 

.017 

.656 

2 

.055-  .500 

.037  .603 

.016 

.624 

3 

.129  .810 

.098 

.711 

.053 

.750 

3 

.116  .648 

.087  .619 

.048 

.703 

4 

.210  .768 

.169 

.749 

.105+ 

.829 

4 

.188  .659 

.150  .733 

.093 

.782 

5 

.232  .790 

.251 

.831 

.171 

...895- 

5 

.222  .778 

.222  .778 

.150 

.850 

6 

.390  .871 

.289 

.902 

.250 

.947 

6 

.341  .812 

.267  .850 

.218 

.907 

7 

.485-  .939 

.442 

.959 

.344 

.983 

7 

.352  .884 

.381  .913 

.297 

.952 

8 

.609  .988 

.557 

.994 

.402 

.999 

8 

.500  .945+ 

.397  .963 

.376 

.984 

9 

.768  1 

.711 

1 

.598 

1 

9 

.648  .990 

.603  .995- 

.488 

.999 

10 

.778  1 

.733  1 

.624 

1 

n  = 

11 

n  =  12 

0 

0  .197 

0 

.250 

0 

.359 

0 

0  .184 

0  .236 

0 

.321 

1 

.010  .315+ 

.005- 

.369 

.001 

.500 

1 

.009  .294 

.004  .346 

.001 

.445+ 

2 

.049  .423 

.033 

.500 

.014 

.598 

2 

.045+  .398 

.030  .450 

.013 

.555- 

3 

.105-  .577 

.079 

.631 

.043 

.660 

3 

.096  .500 

.072  .550 

.039 

.679 

4 

.169  .685- 

.135+ 

.667 

.084 

.738 

4 

.154  .602 

.123  .654 

.076 

.698 

5 

.197  .698 

.200 

.750 

.134 

.806 

5 

.184  .706 

.181  .706 

.121 

.765+ 

♦Use  explained  in  Sec.  2.3.3.  See  Appendix  Charts  II,  III,  and  IV  for  sample  size  «>80. 
This  table  was  calculated  by  Edwin  L.  Crow,  Eleanor  G.  Crow,  and  Robert  S.  Gardner, 
NOTS,  according  to  a  modification  of  a  proposal  of  Theodore  E.  Sterne.  These  limits  are 
usually  tighter  than  those  obtained  by  the  method  of  Clopper  and  Pearson  used  for  Charts 
II-IV,  but  the  difference  is  not  large  for  nZ30  (Edwin  L.  Crow,  "Confidence  Intervals  for 
a  Proportion,”  BIOMETRIKA,  Vol.  43  (1956),  pp.  423-35). 
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TABLE  21  (Contd.) 


(a)  Two-Sided  Limits  (Contd.) 


r 

90% 

95% 

99% 

r 

90% 

95% 

99% 

«  = 

:  11 

n  — 

12 

6 

.302 

.803 

.250 

.800 

.194 

.866 

6 

.271 

.729 

.236 

.764 

.175- 

.825+ 

7 

.315* 

.831 

.333 

.865- 

.262 

.916 

7 

.294 

.816 

.294 

.819 

.235- 

.879 

8 

.423 

.895+ 

.369 

.921 

.340 

.957 

8 

.398 

.846 

.346 

.877 

.302 

.924 

9 

.577 

.951 

.500 

.967 

.407 

.986 

9 

.500 

.904 

.450 

.928 

.321 

.961 

10 

.685- 

.990 

.631 

.995+ 

.500 

.999 

10 

.602 

.955- 

.550 

.970 

.445+ 

.987 

11 

.803 

1 

.750 

1 

.641 

1 

11 

.706 

.991 

.654 

.996 

.555- 

.999 

12 

.816 

1 

.764 

1 

.679 

1 

n  ™ 

13 

n  — 

14 

0 

0 

.173 

0 

.225+ 

0 

.302 

0 

0 

.163 

0 

.207 

0 

.286 

1 

.008 

.276 

.004 

.327 

.001 

.429 

1 

.007 

.261 

.004 

.312 

.001 

.392 

2 

.042 

.379 

.028 

.434 

.012 

.523 

2 

.039 

.365+ 

.026 

.389 

.011 

.500 

3 

.088 

.470 

.066 

.520 

.036 

.594 

3 

.081 

.422 

.061 

.500 

.033 

.608 

4 

.142 

.545- 

.113 

.587 

.069 

.698 

4 

.131 

.578 

.104 

.611 

.064 

.636 

5 

.173 

.621 

.166 

.673 

.111 

.727 

5 

.163 

.594 

.153 

.629 

.102 

.714 

6 

.246 

.724 

.224 

.740 

.159 

.787 

6 

.224 

.645+ 

.206 

.688 

.146 

.751 

7 

.276 

.754 

.260 

.776 

.213 

.841 

7 

.261 

.739 

.207 

.793 

.195- 

.805+ 

8 

.379 

.827 

.327 

.834 

.273 

.889 

8 

.355- 

.776 

.312 

.794 

.249 

.854 

9 

.455+ 

.858 

.413 

.887 

.302 

.931 

9 

.406 

.837 

.371 

.847 

.286 

.898 

10 

.530 

.912 

.480 

.934 

.406 

.964 

10 

.422 

.869 

.389 

.896 

.364 

.936 

11 

.621 

.968 

.566 

.972 

.477 

.988 

11 

.578 

.919 

.500 

.939 

.392 

.967 

12 

.724 

.992 

.673 

.996 

.571 

.999 

12 

.635- 

.961 

.611 

.974 

.500 

.989 

13 

.827 

1 

.775- 

1 

.698 

1 

13 

.739 

.993 

.688 

.996 

.608 

.999 

14 

.837 

1 

.793 

1 

.714 

1 

n  = 

15 

n  =  16 

0 

0 

.154 

0 

.191 

0 

.273 

0 

0 

.147 

0 

.179 

0 

.264 

1 

.007 

.247 

.003 

.302 

.001 

.373 

1 

.007 

.235+ 

.003 

.273 

.001 

.357 

2 

.036 

.326 

.024 

.369 

.010 

.461 

2 

.034 

.805+ 

.023 

.352 

.010 

.451 

3 

.076 

.400 

.057 

.448 

.031 

.539 

3 

.071 

.381 

.053 

.429 

.029 

.525- 

4 

.122 

.500 

.097 

.552 

.059 

.627 

4 

.114 

.450 

.090 

.500 

.055+ 

.579 

5 

.154 

.600 

.142 

.631 

.094 

.672 

5 

.147 

.550 

.132 

.571 

.088 

.643 

6 

.205+ 

.674 

.191 

.668 

.135- 

.727 

6 

.189 

.619 

.178 

.648 

.125+ 

.705- 

7 

.247 

.675- 

.192 

.706 

.179 

.771 

7 

.235+ 

.695- 

.179 

.727 

.166 

.739 

8 

.325+ 

.753 

.294 

.808 

.229 

.821 

8 

.299 

.701 

.272 

.728 

.212 

.788 

9 

.326 

.795- 

.332 

.809 

.273 

.865+ 

9 

.305+ 

.765- 

.273 

.821 

.261 

.834 

10 

.400 

.846 

.369 

.858 

.328 

...906 

10 

.381 

.811 

.352 

.822 

.295+ 

.875- 

11 

.500 

.878 

.448 

.903 

.373 

.941 

11 

.450 

.853 

.429 

.868 

.357 

.912 

12 

.600 

.924 

.552 

.943 

.461 

.969 

12 

.550 

.886 

.500 

.910 

.421 

.945- 

13 

.674 

.964 

.631 

.976 

,539 

.090 

13 

.619 

.929 

.571 

.947 

.475+ 

.971 

14 

.753 

.993 

.698 

.997 

.627 

.999 

14 

.695- 

.966 

.648 

.977 

.549 

.990 

15 

.846 

1 

.809 

1 

.727 

1 

15 

.765- 

.993 

.727 

.997 

.643 

.999 

16 

.853 

1 

.821 

1 

.736 

1 

n  — 

17 

n  -  18 

0 

0 

.140 

0 

.167 

0 

.243 

0 

0 

.135- 

0 

.157 

0 

.228 

1 

.006 

.225+ 

.003 

.254 

.001 

.346 

1 

.006 

.216 

.003 

.242 

.001 

.318 

2 

.032 

.290 

.021 

.337 

.009 

.413 

2 

.030 

.277 

.020 

.825- 

.008 

.397 

3 

.067 

.364 

.050 

.417 

.027 

.500 

3 

.063 

.349 

.047 

.381 

.025+ 

.466 

4 

.107 

.432 

.085- 

.489 

.052 

.587 

4 

.101 

.419 

.080 

.444 

.049 

.534 

5 

.140 

.500 

.124 

.544 

.082 

.620 

5 

.135- 

.482 

.116 

.556 

.077 

.60S 

6 

.175+ 

.568 

.166 

.594 

.117 

.662 

6 

.163 

.536 

.156 

.619 

.110 

.682 

7 

.225+ 

.636 

.167 

.663 

.155+ 

.757 

7 

.216 

.584 

.157 

.625+ 

.145+ 

.686 

8 

.277 

.710 

.253 

.746 

.197 

.758 

8 

.257 

.651 

.236 

.675+ 

.184 

.772 

9 

.290 

.723 

.254 

.747 

.242 

.803 

9 

.277 

.723 

.242 

.758 

.226 

.774 

10 

.364 

.775- 

.337 

.833 

.243 

.845- 

10 

.349 

.743 

.325- 

.764 

.228 

.816 

j 


i 

* 


.JXr- 
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TABLE  21  (Contd.) 


(a)  Two-Sided  Limits  (Contd.) 


r 

90% 

95% 

99% 

r 

90% 

95% 

99% 

n  = 

17 

n  =  18 

11 

.432 

.825- 

.406 

.834 

.338 

.883 

11 

.416 

.784 

.375- 

.843 

.314 

.855- 

12 

.500 

.860 

.456 

.876 

.380 

.918 

12 

.464 

.837 

.381 

.844 

.318 

.890 

13 

.568 

.893 

.511 

.915+ 

.413 

.948 

13 

.518 

.865+ 

.444 

.884 

.397 

.923 

14 

.636 

.933 

.583 

.950 

.500 

.973 

14 

.581 

.899 

.556 

.920 

.466 

.951 

15 

.710 

.968 

.663 

.979 

.587 

.991 

15 

.651 

.937 

.619 

.953 

.534 

.975- 

16 

.775- 

.994 

.746 

.997 

.654 

.999 

16 

.723 

.970 

.675+ 

.980 

.603 

.992 

17 

.860 

1 

.833 

1 

.757 

1 

17 

.784 

.994 

.758 

.997 

.682 

.999 

18 

.865+ 

1 

.843 

1 

.772 

1 

n  — 

19 

n  —  20 

0 

0 

.130 

0 

.150 

0 

.218 

0 

0 

.126 

0 

.143 

0 

.209 

1 

.006 

.209 

.003 

.232 

.001 

.305+ 

1 

.005+ 

.203 

.003 

.222 

.001 

.293 

2 

.028 

.265+ 

.019 

.316 

.008 

.383 

2 

.027 

.255- 

.018 

.294 

.008 

.375- 

3 

.059 

.837 

.044 

.365- 

.024 

.455+ 

3 

.056 

.328 

.042 

.351 

.023 

.424 

4 

.095+ 

.387 

.075+ 

.426 

.046 

.515+ 

4 

.090 

.367 

.071 

.411 

.044 

.500 

5 

.130 

.440 

.110 

.500 

.073 

.564 

5 

.126 

.422 

.104 

.467 

.069 

.576 

6 

.151 

.560 

.147 

.574 

.103 

.617 

6 

.141 

.500 

.140 

.533 

.098 

.601 

7 

.309 

.613 

.150 

.635+ 

.137 

.695- 

7 

.201 

.578 

.143 

.589 

.129 

.637 

8 

.238 

.614 

.222 

.655+ 

.173 

.707 

8 

.221 

.633 

.209 

.649 

.163 

.707 

9 

.265+ 

.663 

.232 

.688 

.212 

.782 

9 

.255- 

.642 

.222 

.706 

.200 

.726 

10 

.337 

.735- 

.312 

.768 

.218 

.788 

10 

.325- 

.675+ 

.293 

.707 

.209 

.791 

11 

.386 

.762 

.345- 

.778 

.293 

.827 

11 

.358 

.745+ 

.294 

.778 

.274 

.800 

12 

.387 

.791 

.385- 

.850 

.305+ 

.863 

12 

.367 

.779 

.351 

.791 

.293 

.887 

13 

.440 

.849 

.426 

.853 

.383 

.897 

13 

.422 

.799 

.411 

.857 

.363 

.871 

14 

.560 

.870 

.500 

.890 

.436 

.927 

14 

.500 

.859 

.467 

.860 

.399 

.902 

15 

.613 

.905- 

.574 

.925- 

.485- 

.954 

15 

.578 

.874 

.533 

.896 

.424 

.931 

16 

.663 

.941 

.835+ 

.956 

.545- 

.976 

16 

.633 

.910 

.589 

.929 

.500 

.956 

17 

.735- 

.972 

.684 

.981 

.617 

.992 

17 

.672 

.944 

.649 

.968 

.576 

.977 

18 

.791 

.994 

.768 

.997 

.695- 

.999 

18 

.745+ 

.973 

.706 

.982 

.625+ 

.992 

19 

.870 

1 

.850 

1 

.782 

1 

19 

.797 

.995- 

.778 

.997 

.707 

.999 

20 

.874 

1 

.857 

1 

.791 

1 

n  ~ 

21 

n  =  22 

0 

0 

.123 

0 

.137 

0 

.201 

0 

0 

.116 

0 

.132 

0 

.194 

1 

.005+ 

.192 

.002 

.213 

.000 

.283 

1 

.005- 

.182 

.002 

.205+ 

.000 

.273 

2 

.026 

.245- 

.017 

.277 

.007 

.347 

2 

.024 

.236 

.016 

.264 

.007 

.334 

3 

.054 

.307 

.040 

.338 

.022 

.409 

3 

.051 

.289 

.038 

.326 

.021 

.396 

4 

.088 

.353 

.068 

.398 

.041 

.466 

4 

.082 

.340 

.065- 

.389 

.039 

.454 

5 

.121 

.407 

.099 

.455+ 

.065+ 

.534 

5 

.115- 

.393 

•0G4 

.424 

.062 

.505- 

6 

.130 

.458 

.132 

.506 

.092 

.591 

6 

.116 

.444 

.126 

.500 

.088 

.550 

7 

.191 

.542 

.137 

.551 

.122 

.653 

7 

.181 

.500 

.132 

.576 

.116 

.604 

8 

.192 

.593 

.197 

.602 

.155- 

.661 

8 

.182 

.556 

.187 

.582 

.147 

.666 

9 

.245- 

.647 

.213 

.662 

.189 

.717 

9 

.236 

.607 

.205+ 

.617 

.179 

.682 

10 

.306 

.693 

.276 

.723 

.201 

.743 

10 

.289 

.660 

.260 

.674 

.194 

.727 

11 

.307 

.694 

.277 

.724 

.257 

.799 

11 

.290 

.710 

.264 

.736 

.242 

.758 

12 

.353 

.755+ 

.338 

.787 

.283 

.811 

12 

.340 

.711 

.326 

.740 

.273 

.806 

13 

.407 

.808 

.398 

.803 

.339 

.845> 

13 

.393 

.764 

.383 

.795- 

.318 

.821 

14 

.458 

.809 

.449 

.863 

.347 

.878 

14 

.444 

.818 

.418 

.813 

.334 

.853 

15 

.542 

.870 

.494 

.868 

.409 

.908 

15 

.500 

.819 

.424 

.868 

.396 

.884 

16 

.593 

.879 

.545- 

.901 

.406 

.935- 

16 

.556 

.884 

.500 

.874 

.450 

.912 

17 

.647 

.914 

.602 

.932 

.534 

.959 

17 

.607 

.885+ 

.576 

.906 

.495+ 

.938 

18 

.693 

.946 

.662 

.960 

.591 

.978 

18 

.660 

.918 

.611 

.935' 

.546 

.961 

19 

.755+ 

.974 

.723 

.983 

.653 

.993 

19 

.711 

.949 

.674 

.962 

.604 

.979 

20 

.808 

.995- 

.787 

.998 

.717 

1.000 

20 

.764 

.976 

.736 

.984 

.666 

.993 

21 

.877 

1 

.863 

1 

.799 

1 

21 

.818 

.995+ 

.795- 

.998 

.727 

1.000 

22 

.884 

1 

.868 

1 

.806 

1 

t 

\ 


TABLE  21  (Contd.) 

(a)  Two-Sided  Limits  (Contd.) 


ft  =  23 

0  0  .111  0  .127  0 

1  .005-  .174  .002  .198  .000 

2  .023  .228  .016  .255-  .007 

3  .049  .274  .037  .317  .020 

4  .078  .328  .062  .861  .038 


6  .111 

7  .173 

8  .174 

9  .228 

10  .273 

I!  .274 

12  .328 

13  .381 

14  .431 

15  .478 

16  .521 

17  .569 

18  .619 

19  .672 

20  .726 

21  .772 

22  .826 

23  .889 


0  0  .102 

1  .004  .159 

2  .021  .214 

3  .045-  .255- 

4  .072  .807 

5  .101  .862 

6  .102  .890 

7  .158  .482 

8  .159  .500 

9  .214  .568 

10  .246  .610 

11  -255-  .611 

12  .307  .640 

13  .360  .693 

14  .389  .745+ 

15  .390  .754 

16  .432  .786 

17  .500  .841 

18  .568  .842 

19  .610  .898 

20  .638  .899 

21  .693  .928 

22  .745+  .955+ 

23  .786  .979 

24  .841  .996 

25  .898  1 


.409 

.059 

.500 

.457 

.084 

.571 

.548 

.111 

.580 

.591 

.140 

.616 

.639 

.171 

.677 

.640 

.187 

.702 

.888 

.229 

.735- 

.745+ 

.265+ 

.771 

.753 

.298 

.813 

.802 

.323 

.829 

.822 

.384 

.860 

.873 

.420 

.889 

.880 

.429 

.916 

.910 

.500 

.941 

.938 

.571 

.962 

.963 

.614 

.980 

.984 

.677 

.993 

.998 

.735- 

1.000 

1 

.813 

1 

25 

.118 

0 

.175+ 

.185+ 

.000 

.216 

.238 

.006 

.305- 

.303 

.018 

.352 

.336 

.034 

.403 

.384 

.054 

.451 

.431 

.077 

.500 

.475- 

.101 

.549 

.525+ 

.127 

.597 

.569 

.155+ 

.648 

.616 

.175+ 

.658 

.664 

.205+ 

.695+ 

.683 

.245+ 

.754 

.704 

.246 

.755- 

.762 

.305- 

.795- 

.778 

.342 

.825- 

.815- 

.352 

.845- 

.839 

.403 

.873 

.882 

.451 

.899 

.893 

.500 

.923 

.918 

.549 

.946 

.943 

.597 

.966 

n  —  24 

~~0  0  .105+  0  .122 

1  .004  .165+  .002  .191 

2  .022  .221  .015+  .246 

3  .047  .264  .035-  .308 

4  .075-  .317  .059  .347 

5  .105-  .370  .086  .896 

6  .105+  .423  .115*  .143 

7  .165-  .448  .122  .500 

8  .165+  .552  .169  .557 

9  .221  .553  .191  .604 

10  .259  .587  .234  .653 


.697  .966 
.762  .986 
.815-  .998 
.882  1 


.754  1.000 
.825-  1 


11  .264 

12  .317 

13  .370 

14  .413 

15  .447 

1C  .448 

17  .552 

18  .577 

19  .630 

20  .683 

21  736 

22  .779 

23  .835' 

24  .895- 


0  0 

1  .004 

2  .021 

3  .043 

4  .069 

5  .097 

6  .098 

7  .151 

8  .152 

9  .209 

10  .233 

"li  .247 

12  .299 

13  .342 

14  .343 

15  .377 

16  .419 

17  .460 

18  .540 

19  .581 

20  .623 

"21  .657 

22  .701 

23  .753 

24  .791 

25  .848 

26  .902 


.630  .246  .( 
.683  .308  .( 
.736  .339  .1 
.741  .347  .1 

.779  .396  A 

.885-  .443  A 
.835+  .500  A 
.895-  .557  A 
.89C'  604  -1 
.925+  .653  .f 

.953  .692  .* 
.978  .754  A 
.996  .809  .1 
1  .878 

n  =  26 


0  .181 
.000  .259 

.006  .313 

.019  .864 

.036  .416 

.057  .464 

.080  .536 

.106  .584 

.133  .636 

.163  .688 

.181  .687 

T2I6  .720 
.257  .743 

.280  .784 

.313  .819 

.362  .837 

.364  .867 

.416  .894 

.464  .920 

.536  .943 

.504  .964 

^636  .981 

.687  .994 

.741  1.000 
.819  1 


0  .114 

.002  .180 
.014  .230 
.032  .283 
.054  .825+ 
.079  .374 

.106  .421 
.114  .485- 
.154  .506 
.180  .542 
.212  .579 

.230  .626 
.282  .675- 
.283  .717 
.325+  .718 
.374  .770 

.421  .788 
.458  .820 
.494  .846 
.535-  .886 
.570  .894 

.626  .921 
.675-  .946 
.717  ,968 
.770  .986 
.820  .998 
.886  1 
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TABLE  21  (Contd.) 

(a)  Two-Sided  Limits  (Contd.) 


r 

90% 

95% 

99% 

r 

90% 

95% 

999 

i 

n  — 

27 

»  =  28 

0 

0 

.098 

0 

.110 

0 

.166 

0 

0 

.090 

0 

.106 

0 

.162 

1 

.004 

.146 

.002 

.175- 

.000 

.225- 

1 

.004 

.140 

.002 

.170 

.000 

.218 

2 

.020 

.204 

.013 

.223 

.006 

.297 

2 

.019 

.201 

.013 

.217 

.005+ 

.278 

3 

.042 

.289 

.031 

.270 

.017 

.332 

3 

.040 

.232 

.030 

.259 

.016 

.328 

4 

.066 

.291 

.052 

.316 

.032 

.384 

4 

.064 

.284 

.050 

.307 

.031 

.365- 

5 

.093 

.827 

.076 

.364 

.050 

.419 

5 

.089 

.312 

.073 

.357 

.048 

.408 

6 

.094 

.865+ 

.101 

.415- 

.070 

.461 

6 

.090 

.855- 

.098 

.384 

.068 

.449 

7 

.145* 

.407 

.110 

.437 

.093 

.539 

7 

.139 

.396 

.106 

.424 

.089 

.500 

8 

.146 

.447 

.148 

.500 

.117 

.581 

8 

.140 

.435+ 

.142 

.463 

.112 

.551 

9 

.204 

.500 

.175- 

.563 

.143 

.587 

9 

.197 

.473 

.170 

.537 

.137 

.592 

10 

.221 

.553 

.202 

.570 

.166 

.617 

10 

.208 

.527 

.192 

.576 

.162 

.635+ 

11 

-.239 

.593 

.223 

.598 

.185- 

.668 

11 

.232 

.565- 

.217 

.616 

.175+ 

.636 

12 

.291 

.685- 

.269 

.636 

.224 

.702 

12 

.284 

.604 

.258 

.619 

.214 

.677 

13 

.326 

.673 

.270 

.684 

.225- 

.716 

13 

.310 

.845+ 

.259 

.645+ 

.218 

.727 

14 

.327 

.674 

.316 

.780 

.284 

.775+ 

14 

.312 

.688 

.307 

.693 

.272 

.728 

15 

.365+ 

.709 

.364 

.731 

.298 

.776 

15 

.355- 

.690 

.355- 

.741 

.273 

.782 

16 

.407 

.761 

.402 

.777 

.332 

.815+ 

16 

.396 

.716 

.381 

.742 

.323 

.786 

17 

.447 

.779 

.430 

.798 

.383 

.834 

17 

.435+ 

.768 

.384 

.783 

.364 

.825- 

18 

.500 

.796 

.437 

.825+ 

.413 

.857 

18 

.473 

.792 

.424 

.808 

.365- 

.838 

19 

.553 

.854 

.500 

.852 

.419 

.883 

19 

.527 

.803 

.463 

.830 

.408 

.863 

20 

.593 

.855- 

.563 

.890 

.461 

.907 

20 

.565- 

.860 

.537 

.858 

.449 

.888 

21 

.635- 

.906 

.585+ 

.899 

.539 

.930 

21 

.604 

.861 

.576 

.894 

.500 

.911 

22 

.673 

.907 

.636 

.924 

.581 

.950 

22 

.645+ 

.910 

.616 

.902 

.551 

.932 

23 

.709 

.934 

.684 

.948 

.616 

.968 

23 

.688 

.911 

.643 

.927 

.592 

.952 

24 

.761 

.958 

.730 

.969 

.668 

.983 

24 

.716 

.936 

.693 

.950 

.635+ 

.969 

25 

.796 

.980 

.777 

.987 

.703 

.994 

25 

.768 

.960 

.741 

.970 

.677 

.984 

26 

.854 

.996 

.825+ 

.998 

.775+ 

1.000 

26 

.799 

.981 

.783 

.987 

.727 

.995- 

27 

.907 

1 

.890 

1 

.834 

1 

27 

.860 

.996 

.830 

.998 

.782 

1.000 

28 

.910 

1 

.894 

1 

.838 

1 

n  = 

29 

»  =  30 

0 

0 

.087 

0 

.103 

0 

.160 

0 

0 

.084 

0 

.100 

0 

.152 

1 

.004 

.135- 

.002 

.166 

.000 

.211 

1 

.004 

.130 

.002 

.163 

.000 

.206 

2 

.018 

.190 

.012 

.211 

.005+ 

.263 

2 

.018 

.183 

.012 

.205+ 

.005+ 

.256 

3 

.039 

.225- 

.029 

.251 

.015+ 

.316 

3 

.037 

.219 

.028 

.244 

.015- 

.810 

4 

.062 

.279 

.049 

.299 

.030 

.354 

4 

.059 

.266 

.047 

.292 

.028 

.345- 

5 

.086 

.80S 

.070 

.340 

.046 

.397 

5 

.083 

.295- 

.068 

.825- 

.045- 

.888 

6 

.087 

.345- 

.094 

.374 

.065+ 

.438 

6 

.084 

.336 

.091 

.364 

.063 

.430 

7 

.134 

.385+ 

.103 

.413 

.086 

.477 

7 

.129 

.376 

.100 

.403 

.083 

.469 

8 

.135- 

.425- 

.136 

.451 

.108 

.523 

8 

.130 

.416 

.131 

.440 

.104 

.505+ 

9 

.189 

.468 

.166 

.500 

.132 

.562 

9 

.182 

.455+ 

.163 

.476 

.127 

.538 

10 

.190 

.500 

.184 

.549 

.157 

.603 

10 

.183 

.492 

.175+ 

.524 

.151 

.570 

11 

.225 

.537 

.211 

.537 

.165+ 

.646 

11 

.219 

.524 

.205+ 

.560 

.152 

.612 

12 

.276 

.575+ 

.247 

.626 

.206 

.654 

12 

.265- 

.554 

.236 

.597 

.198 

.655+ 

13 

.294 

.615- 

.251 

.660 

.211 

.634 

13 

.266 

.584 

.244 

.336 

.206 

.671 

14 

.303 

.655+ 

.299 

.661 

.260 

.737 

14 

.295- 

.624 

.292 

.675+ 

.249 

.692 

15 

.345- 

.697 

.339 

.701 

.263 

.740 

15 

.338 

.664 

.324 

.676 

.256 

.744 

16 

.385+ 

.706 

.340 

.749 

.316 

.789 

16 

.376 

.705+ 

.325- 

.708 

.308 

.751 

17 

.425- 

.724 

.374 

.753 

.346 

.794 

17 

.416 

.734 

.364 

.756 

.329 

.794 

18 

.463 

.775+ 

.413 

.789 

.354 

.835- 

18 

.446 

.735+ 

.403 

.764 

.345- 

.802 

19 

.500 

.810 

.451 

.816 

.397 

.843 

19 

.476 

.781 

.440 

.795- 

.388 

.848 

20 

.537 

.811 

.500 

.834 

.438 

.868 

20 

.508 

.817 

.476 

.825- 

.430 

.849 

21 

.575+ 

.865+ 

.549 

.864 

.477 

.892 

21 

.545- 

.818 

.524 

.887 

.462 

.878 

22 

.615- 

.866 

.587 

.897 

,523 

.914 

22 

.584 

.870 

.560 

.869 

.495- 

.896 

23 

.655+ 

.013 

.626 

.906 

.562 

.935- 

23 

.624 

.871 

.597 

.900 

.531 

.917 

24 

.697 

.914 

.660 

.930 

.603 

.954 

24 

.664 

.916 

.636 

.909 

.570 

.937 

25 

.721 

.988 

.701 

.951 

.646 

.970 

25 

.705+ 

.917 

.675+ 

.932 

.612 

.955+ 

26 

.775+ 

.961 

.749 

.971 

.684 

.985- 

26 

.734 

.941 

.708 

.953 

.655+ 

.972 

27 

.810 

.982 

.789 

.988 

.737 

.995- 

27 

.781 

.963 

.756 

.972 

.690 

.985+ 

28 

.865+ 

.996 

.834 

.998 

.789 

1.000 

28 

.817 

.982 

.795- 

.988 

.744 

.995- 

29 

.913 

1 

,897 

1 

.840 

1 

29 

.870 

.996 

.837 

.998 

.794 

1.000 

30 

.916 

1 

.900 

1 

.848 

1 

J 


i 
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TABLE  21  (Contd.) 

(b)  One-Sided  Limits* 


If  the  observed  proportion  is  r/n,  enter  the  table  with  n  and  r  for  an  upper  one¬ 
sided  limit.  For  a  lower  one-sided  limit,  enter  the  table  with  n  and  n  —  r  and 
subtract  the  table  entry  from  1 . 


f 

90% 

95% 

99% 

r 

90% 

95% 

99% 

r 

90% 

95% 

99% 

n  =  2 

n  =  3 

n  —  4 

0 

.684 

.776 

.900 

0 

.536 

632 

.785- 

0 

.438 

.527 

.684 

1 

.949 

.975- 

.995- 

1 

.804 

.865- 

.941 

1 

.680 

.751 

.859 

2 

.965+ 

.983 

.997 

2 

.857 

.902 

.958 

3 

.974 

.987 

.997 

n  —  5 

n  =  8 

n  =  7 

0 

.369 

.451 

.602 

0 

.319 

.393 

.536 

0 

.280 

.348 

.482 

1 

.584 

.657 

.778 

1 

.510 

.582 

.700 

1 

.453 

.521 

.643 

2 

.753 

.811 

.894 

2 

.667 

.729 

.827 

2 

.596 

.659 

.764 

3 

.888 

.924 

.967 

3 

.799 

.847 

.915+ 

3 

.721 

.775- 

.858 

4 

.979 

.990 

.998 

4 

.907 

..'37 

.973 

4 

.830 

.871 

.929 

5 

.983 

.991 

.998 

5 

.921 

.947 

.977 

6 

.985+ 

.993 

.999 

n  —  8 

n  —  9 

n  =  10 

0 

.250 

.312 

.438 

0 

.226 

.283 

.401 

0 

.206 

.259 

.369 

1 

.406 

.471 

.590 

1 

.368 

.429 

.544 

1 

.337 

.394 

.504 

2 

.538 

.600 

.707 

2 

.490 

.550 

.656 

2 

.450 

.507 

.612 

3 

.855+ 

.711 

.802 

3 

.599 

.655+ 

.750 

3 

.552 

.607 

.703 

4 

.760 

.807 

.879 

4 

.699 

.749 

.829 

4 

.646 

.696 

.782 

5 

.853 

.889 

.939 

5 

.790 

.831 

.895- 

5 

.733 

.778 

.850 

6 

.931 

.954 

.980 

6 

.871 

.902 

.947 

6 

.812 

.850 

.907 

7 

.987 

.994 

.999 

7 

.939 

.959 

.983 

7 

.884 

.913 

.952 

8 

.988 

.994 

.999 

8 

.945+ 

.963 

.984 

9 

.990 

.995- 

.999 

n  =  11 

n  =  12 

n  =  13 

0 

.189 

238 

.342 

0 

.175- 

.221 

.319 

0 

.162 

.206 

.298 

1 

.310 

.364 

.470 

1 

.287 

.339 

.440 

1 

.268 

.316 

.413 

2 

.415+ 

.470 

.572 

2 

.386 

.438 

.537 

2 

.360 

.410 

.506 

3 

.511 

.564 

.660 

3 

.475+ 

.527 

.622 

3 

.444 

.495- 

.588 

4 

.599 

.650 

.738 

4 

.559 

.609 

.638 

4 

.523 

.573 

.661 

5 

.682 

.729 

.806 

5 

.638 

.685- 

.765+ 

5 

.598 

.645+ 

.727 

6 

.759 

.800 

.866 

6 

.712 

.755- 

.825+ 

6 

.689 

.713 

.787 

7 

.831 

.865- 

.916 

7 

.781 

.819 

.879 

7 

.736 

.776 

.841 

8 

.895+ 

.921 

.957 

8 

.846 

.877 

.924 

8 

.799 

.834 

.889 

9 

.951 

.987 

.986 

9 

.904 

.928 

.961 

9 

.858 

.887 

.931 

10 

.990 

.995+ 

.999 

10 

.955- 

.970 

.987 

10 

.912 

.934 

.964 

11 

.991 

.996 

.999 

11 

.958 

.972 

.988 

12 

.992 

.996 

.999 

n  —  14 

n  —  15 

n  —  16 

0 

.152 

.193 

.280 

0 

.142 

.181 

.264 

0 

.134 

.171 

.250 

1 

.251 

.297 

.389 

1 

.236 

.279 

.368 

1 

.222 

.264 

.349 

2 

.337 

.385+ 

.478 

2 

.317 

.363 

.453 

2 

.300 

.344 

.430 

3 

.417 

.466 

.557 

3 

.393 

.440 

.529 

3 

.371 

.417 

.503 

4 

.492 

.540 

.627 

4 

.464 

.511 

.597 

4 

.439 

.484 

.569 

5 

.563 

.610 

.692 

5 

.532 

.577 

.660 

5 

.504 

.548 

.630 

*  Use  explained  in  Sec.  2.3.3,  See  Sec.  2.3.3  for  sample  size  n>  30.  This  table  was  com¬ 
puted  by  Robert  S.  Gardner,  NOTS. 
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TABLE  21  (Contd.) 


(b)  One-Sided  Limits  (Contd.) 


r 

90% 

95% 

99% 

f 

90% 

95% 

99% 

r 

90% 

95% 

99% 

»  =  23 

n  =  24 

n 

=  25 

0 

.095+ 

.122 

.181 

0 

.091 

.117 

.175- 

0 

.088 

.113 

.168 

1 

.159 

.190 

.256 

1 

.153 

.183 

.246 

1 

.147 

.176 

.237 

2 

.215+ 

.249 

.318 

2 

.207 

.240 

.307 

2 

.199 

.231 

.296 

3 

.268 

.304 

.374 

3 

.258 

.292 

.361 

3 

.248 

.282 

.349 

4 

.318 

.355- 

.427 

4 

.306 

.342 

.412 

4 

.295* 

.330 

.398 

5 

.366 

.404 

.476 

5 

.352 

.389 

.460 

5 

.340 

.375+ 

.444 

6 

.413 

.451 

.522 

6 

.398 

.435- 

.505- 

6 

.383 

.420 

.488 

7 

.459 

.496 

.567 

7 

.442 

.479 

.548 

7 

.426 

.462 

.531 

8 

.503 

.540 

.609 

8 

.484 

.521 

.590 

8 

.467 

.504 

.571 

9 

.546 

.583 

.650 

9 

.526 

.563 

.630 

9 

.508 

.544 

.610 

10 

.589 

.625- 

.689 

10 

.567 

.603 

.668 

10 

.548 

.583 

.648 

11 

.630 

.665- 

.727 

11 

.608 

.642 

.705- 

11 

.587 

.621 

.684 

12 

.670 

.704 

.763 

12 

.647 

.681 

.740 

12 

.625- 

.659 

.719 

13 

.710 

.742 

.797 

13 

.685+ 

.718 

.774 

13 

.662 

.695- 

.752 

14 

.748 

.778 

.829 

14 

.723 

.754 

.806 

14 

.699 

.730 

.784 

15 

.786 

.814 

.860 

15 

.759 

.788 

.837 

15 

.735- 

.764 

.815+ 

16 

.822 

.848 

.889 

16 

.795+ 

.822 

.867 

16 

.770 

.798 

.845- 

17 

.857 

.880 

.916 

17 

.830 

.854 

.894 

17 

.804 

.830 

.873 

18 

.890 

.910 

.941 

18 

.863 

,885+ 

.920 

18 

.837 

.861 

.899 

19 

.922 

.938 

.962 

19 

.895+ 

.914 

.943 

19 

.869 

.890 

.923 

Vrm 

.951 

.963 

.980 

20 

.925+ 

.941 

.964 

20 

.899 

.918 

.946 

*1 

.977 

.984 

.993 

21 

.953 

.965+ 

.981 

21 

.928 

.943 

.966 

22 

.995+ 

.998 

1.000 

22 

.978 

.985- 

.994 

22 

.955+ 

.986 

.982 

23 

.996 

.998 

1.000 

23 

.979 

.986 

.994 

24 

.996 

.998 

1.000 

»  =  26 

n  =  27 

n 

-  28 

0 

.085- 

.109 

.162 

0 

.082 

.105+ 

.157 

0 

.079 

.101 

.152 

1 

.142 

.170 

.229 

1 

.137 

.164 

.222 

1 

.132 

.159 

,215- 

2 

.192 

.223 

.286 

2 

.185+ 

.215+ 

.277 

2 

.179 

.208 

.268 

3 

.239 

.272 

.337 

3 

.231 

.263 

.326 

3 

.223 

.254 

.316 

4 

.284 

.318 

.385- 

4 

.275- 

.308 

.373 

4 

.265+ 

.298 

.361 

5 

.328 

.363 

.430 

5 

.317 

.351 

.417 

5 

.306 

.339 

.404 

6 

.370 

.405+ 

.473 

6 

.358 

.392 

.458 

6 

.346 

.380 

.445- 

7 

.411 

.447 

.514 

7 

.397 

.432 

.498 

7 

.385- 

.419 

.484 

8 

.451 

.487 

.554 

8 

.436 

.471 

.537 

8 

.422 

.457 

.521 

9 

.491 

.526 

.592 

9 

.475- 

.509 

.574 

9 

.459 

.4^4 

.558 

10 

.529 

.564 

.628 

10 

.512 

.547 

.610 

10 

.496 

.530 

.593 

11 

.567 

.602 

.664 

11 

.549 

.583 

.645+ 

11 

.532 

.505+ 

.627 

12 

.604 

.638 

.698 

12 

.585- 

.618 

.679 

12 

.567 

.600 

.660 

13 

.641 

.673 

.731 

13 

.620 

.653 

.711 

13 

.601 

.634 

.692 

14 

.676 

.708 

.763 

14 

.655t 

.687 

.743 

14 

.635+ 

.667 

.723 

15 

.711 

.742 

.794 

15 

.689 

.720 

.773 

15 

.669 

.099 

753 

16 

.746 

.774 

.823 

16 

.723 

.752 

.802 

16 

.701 

.731 

.782 

17 

.779 

.806 

.851 

17 

.756 

.783 

.831 

17 

.733 

.762 

.810 

18 

.812 

.837 

.878 

18 

.788 

.814 

.857 

18 

.765- 

.792 

.837 

19 

.843 

.866 

.903 

19 

.819 

.843 

.883 

19 

.796 

.821 

.863 

20 

.874 

.894 

.927 

20 

.849 

.871 

.907 

20 

.826 

.849 

.888 

21 

.903 

.921 

.948 

21 

.879 

.899 

.930 

21 

.855+ 

.876 

.911 

22 

.931 

.946 

.967 

22 

.907 

.924 

.950 

22 

.883 

.902 

.932 

23 

.957 

.968 

.983 

23 

.934 

.948 

.988 

23 

.911 

.927 

.952 

24 

.879 

.986 

.994 

24 

.958 

.939 

.083 

24 

.93e 

.950 

.969 

25 

.996 

.998 

1.000 

25 

.980 

.987 

.994 

25 

.960 

.970 

.984 

26 

.996 

.998 

1.000 

26 

.981 

.987 

.995- 

27 

.996 

.998 

1.000 

3 
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APPENDIX  CHARTS 
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CHART  I.  Rectangular  Normal  Probability  Chart* 

P  —  0,001  per  rectangle.  One  quadrant  of  the  chart  is  shown. 


•  Use  explained  in  Sec.  1.2.1.  This  chart  was  constructed  by  A.  D.  Sprague,  Bureau  of 
Ordnance,  and  is  reproduced  with  the  permission  of  R.  S.  Burin gton  and  A.  D.  Sprague. 
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POPULATION  PROPORTION 
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CHART  II.  90%  Confidence  Belts  for  Proportions* 


0  0.1  0.2  0.3  0.4  as  0.6  0.7  0.8  0.9  1.0 

OBSERVED  PROPORTION,  rtn 


•Use  explained  in  Sec.  2.3.3a.  Table  21  should  be  used  for  sample  size  nSZO.  It  gives 
the  interval  explicitly  for  each  sample  size.  This  chart  was  adapted  with  permission  from  W. 
J.  Dixon  and  F.  J.  Massey,  Jr.,  Introduction  to  Statistical  Analysis,  New  York,  McUrr.w- 
Hill,  1951,  p.  320. 


POPULATION  PROPORTION ,p 


271 


CHART  III.  95%  Confidence  Belts  for  Proportions* 


*  Uie  explained  in  Sec.  2.3.3a.  Table  21  should  be  used  for  sample  size  n  <  30.  It  gives 
the  interval  explicitly  for  each  sample  size.  This  chart  was  taken  from  C.  T.  Cloppet  and 
E.  S.  Pearson,  “The  Use  of  Confidence  or  Fiducial  Limits  Illustrated  in  the  Case  of  the 
Binomial.”  BIOMETRIKA,  Vol.  26  <1931),  pp.  404-13. 


POPULATION  PROPORTION,  p 
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CHART  IV.  99%  Confidence  Belts  for  Proportions* 


0  0.1  0.2  0.3  0.4  0.5  0.6  0.7  0.8  0.9  1.0 


OBSERVED  PROPORTION,  r/n 


0  Use  explained  in  Sec.  2.3.3a.  Table  21  should  be  used  for  sample  size  n  <  30.  It  gives 
the  interval  explicitly  for  each  sample  size.  This  chart  was  taken  from  C.  J.  Clopper  and 
E.  S.  Pearson,  "The  Use  of  Confidence  or  B’ducial  Limits  Illustrated  in  the  Case  of  the 
Binomial,”  BIOMETRIKA,  Vol.  26  (1934),  pp.  404-13. 


CHART  V.  OC  Curves  for  Testing  the  Hypothesis  h  =  a  by  the  Equal-Tails  Normal  Test 


(1946) 


CHART  VI.  OC  Curves  for  Testing  the  Hypothesis  h  =  a  by  the  Equal-Tails  /  Test  ♦ 


CHART  VII.  OC  Curves  for  Testing  the  Hypothesis  <t  —  <r0  Against  <r  =  ax  >  a0  by  the  x2  Test 


CHART  VIII.  OC  Curves  for  Testing  the  Hypothesis  *x  =  <r2  Against 
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CHART  IX.  Number  of  Degrees  of  Freedom  Required  To 
Estimate  the  Standard  Deviation  Within  p%  of  Its 
True  Value,  With  Prescribed  Confidence  * 


5  6  8  10  20  30  40  50 

P  % 


•  Use  explained  in  Sec.  3.2.3a;  calculate  confidence  intervals  as  in  Sec.  3,2.3.  Adapted, 
with  the  permission  of  the  American  Statistical  Association,  from  J.  A.  Greenwood  and  M.  M. 
Sandomire,  “Sample  Size  Required  for  Estimating  the  Standard  Deviation  as  a  Percent  of 
Its  True  Value,'  AM  STAT  ASSOC,  J,  Vol.  45  (1950),  p.  258.  (The  manner  of  graphing 
has  been  changed.) 


DEGREES  OF  FREEDOM,/ 
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CHART  X.  Number  of  Degrees  of  Freedom  in  Each 
Sample  Required  To  Estimate  a1/o2  Within  p%  of 
Its  True  Value,  With  Prescribed  Confidence  * 


5  6  8  10  20  30  40  50 

P% 


•  Use  explained  in  Sec.  3.3.3a;  calculate  confidence  intervals  as  in  Sec.  3.3.3.  This  chart 
was  constructed  by  E.  L.  Crow  and  M.  W.  Maxfield,  NOTS. 


one  in  general,  in  order  that  the  resulting  reduced  equation  be  a  least-squares 
estimate  of  the  (reduced)  population  regression  plane.  If  the  experiment  is 
designed  with  such  symmetry  that  the  off-diagonal  coefficients  ahj  (h  =£ ;') 
of  the  normal  equations  are  zero,  following  course  3  is  equivalent  to 
following  course  2  without  drawing  a  new  sample.  Also,  course  3  is  some¬ 
times  justified  as  an  approximation  to  course  1  when,  as  in  the  example, 
the  differences  between  the  /’s  as  predicted  by  the  resulting  reduced 
equation  and  by  the  equation  using  all  the  variables  are  negligible. 

In  any  case  the  choice  of  equation  is  aided  by  compr.-ison  of  the  sizes 
of  the  standard  errors  of  estimate  sv |12...*  of  the  various  equations  con¬ 
sidered.  In  an  extreme  case  the  regression  as  a  whole  may  be  significant 
according  to  Sec.  6.2.3a,  although  no  bi  is  significant  individually.  It  may 
happen  in  such  a  case  that  some  subset  of  the  original  set  of  independent 
variables  yields  practically  as  useful  a  regression  as  the  original.  This  is 
exemplified  in  Ref.  6.7,  p.  646,  together  with  a  method  of  finding  the 
appropriate  subset  of  independent  variables. 

Examplt.  In  the  previous  example,  we  found  that  two  regression 
coefficients,  b1  and  b2,  were  significant,  but  the  third,  b3,  was  not  signifi¬ 
cant.  What  would  the  practical  effect  in  estimating  y  be  if  we  were  to 
discard  the  term  in  x3  ? 

Over  the  range  of  applicability  of  the  regression,  the  maxi¬ 
mum  error  introduced  by  discarding  the  term  b3  (x3  -  x3)  would  be 
(0.0486)  (10.05)  (10~3)  =  0.00049  inch.  Therefore,  we  should  follow 
course  3  of  the  explanation  above,  discarding  the  term  becav.se  its 
contribution  is  not  only  not  significant  statistically  but  is  also  negligible 
numerically. 

f.  Confidence  Interval  for  Ordinate  to  the  True  Repression  Plane. 

Confidence  limits  for  the  true  mean  y  value  (not  for  an  individual  predicted 
y)  at  the  x  point  (Xu  X2,  •  •  •  ,  X*)  are 

Y  —  ^a/3,n-fc-xJv|i2”*fc 

where  /  is  calculated  from  the  regression  equation.  The  x  point  need  not 
be  one  of  those  used  in  finding  the  regression  plane.  Note  that  there  are  k2 
terms  in  the  summation,  k  squares  plus  two  equal  cross-products  for  each 

pair  (b,  /). 


CHART  XI.  95%  Confidence  Belts  for  Correlation 

Coefficient  * 


Use  explained  in  Sec.  6.1.4b.  The  numbers  on  the  curves  indicate  sample  size  for  the 
case  of  a  two-variable  linear  regression.  Use  the  curve  labeled  n  —  k  -f  1  to  find  confidence 
intervals  for  the  partial  correlation  coefficients  in  a  multiple  linear  regression  with  a  total 
of  fc  -4^  1  variables  and  n  observations.  Reproduced,  with  the  permission  of  E.  S.  Pearson, 
5i?m  yy  ™  David,  Tables  of  the  Correlation  Coefficient ,  London,  Biometrika  Office,  1938, 
Chart  II.  Charts  for  90,  98,  and  99%  confidence  belts  are  given  in  the  book. 
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INDEX 


A 

a,  y  intercept  of  regression  line,  152 
asi,  coefficients  in  normal  equations  for 
multiple  regression,  170 
Absolute  value,  39 
Ac,  acceptance  number,  212 
Acceptable  quality  level,  AQL,  211 
Acceptance  number,  Ac,  212 
Acceptance  sampling,  209-26,  table,  255 
uses,  218,  220 
Accuracy,  67 

Additivity  of  meins  and  variances,  59, 
110,  119 

Additivity  of  variances,  69,  1 10 

a,  level  of  significance,  16,  4 1 
American  Society  for  Quality  Control, 

208 

American  Society  for  Testing  Materials, 
205,  254 

American  Statistical  Association,  225 
Analysis  of  covariance,  111,  148 
Analysis  of  variance,  109-46 
assumptions,  119 
one-factor  analysis,  120 
multiple-factor  analysis,  139 
two-factor  analysis,  127 
Type  I  problems,  118 
Type  II  problems,  118 
Anderson,  R.  L.,  144,  251 
Anderson,  T.  W.,  106 
AQL,  acceptable  quality  level,  211 
Arkin,  H.,  193 

Asterisk  (*)  to  denote  significance,  16 
Attributes,  196,  209 
or  variables,  which  to  use  in 
acceptance  sampling,  210 
Average  (mean),  iO 
Average  sample  number,  213 

B 

b ,  regression  coefficient  in  simple 

regression,  152 
confidence  intervals,  160 
tests  of  significance,  160 
bj,  partial  regression  coefficients,  169 
confidence  intervals,  179 
tests  of  significance,  179 


Background  conditions  in  an 
experiment,  109 
Baines,  A.  H.  J.,  250 
Bancroft,  T.  A.,  144 
Bartlett,  M.  S.,  106,  144 
Best  fitting  curve,  151,  153,  183 
/?,  risk  of  a  Type  II  error,  16,  40 
Bias,  12 

Binomial  coefficients,  30 
Binomial  distribution,  30 
approximation  by  Poisson 
distribution,  32 
example  of  use,  31 
references  to  tables,  35 
test  for  proportion,  48-50 
Bliss,  C.  I.,  106 
Bowker,  Albert  H.,  225,  253 
British  Standards  Institution,  208 
Brownlee,  K.  A.,  35 
Bruceton  method,  93 
Bureau  of  Ordnance,  106,  205,  225 
Burington,  R.  S„  29,  35,  267 


c 


c;,  C(n,  x),  nC„  binomial 
coefficients,  30 
Cell,  133 

Center  of  gravity,  11 
Central  line,  196 
Central  location,  measures  of,  10 
X*  distribution,  table,  232 
X*  test 

for  consistency  of  several  samples,  100 

for  goodness  of  fit,  85 

for  independence  in  a  contingency 


table,  97 

for  a  normal  distribution,  87 
for  standard  deviation,  70-72,  chart. 


Churchman,  C.  W.,  35,  64,  81,  145,  194 

Circular  probable  error,  29 

Clopper,  C.  J.,  271,  272 

Cochran,  W.  G.,  145 

Coding,  14,  68 

Colton,  R.  R.,  193 

Comparison  of  means,  see  Difference  in 
means.  Trend  in  means,  Analysis 
of  variance 
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Comparison  of  several  samples,  x ' test 
for  consistency,  100 
Comparison  of  two  samples  by  the  run 
test,  101 

Components  of  variance,  119,  123,  126, 
127,  132,  140,  144 
Confidence  band  for  cumulative 
distribution,  90,  table ,  248 
Confidence  coefficient,  18,  43 
Confidence  intervals,  18,  43 
approximate  method  for  constructing, 
18 

for  mean,  see  Mean,  confidence 
intervals  for;  for  standard 
deviation,  see  Standard  deviation, 
confidence  intervals  for;  etc. 
Confidence  limits,  18 
Consistency  test  for  several  samples,  100 
Construction  of  control  charts,  1 99,  202 
Consumer’s  risk,  16,  218 
Contingency  tables,  test  for  independence, 
-J7 

formula  for  2-by-2  table,  100 
Y ates’s  correction,  99 
Continuous  distribution,  10 
Control  charts,  195-208,  table,  254 
Control  limits,  196,  199,  202 
Correlation,  147-94 
Correlation  coefficient,  157 
confidence  intervals,  159,  chart,  279 
meaning  in  Type  I  and  II  problems, 
158 

multiple,  175,  178 
partial,  176,  178 

test  of  significance,  159,  table,  241 
Costs 

considered  in  acceptance  sampling,  213 
considered  in  selecting  sample  size,  17 
Covariance,  analysis  of,  111,  148 
Cowden,  D.  j.,  193 
Cox,  G.  M.,  145 
Cramer,  Harald,  106,  193 
Crow,  Edwin  L.,  29,  226,  257,  278 
Crow,  Eleanor  G,,  242,  257 
Croxton,  F.  E.,  193 
Culling,  H.  P.,  107 
Cumulative  distribution  function,  8 
confidence  band,  90,  table,  248 
normal,  20,  tables ,  229-30 
test  by  confidence  band,  91 
Curve  fitting,  147-94 

D 

Data,  grouping  of,  5 
David,  F.  N.,  241,  279 


Defectives 

binomial  distribution  for  proportion, 
30,  48 

Poisson  distribution  for  random 
occurrence,  34 
Definitions,  3 
Degrees  of  freedom,  19 
for  error  estimate  in  a  design,  111 
8s,  mean  square  successive 
difference,  63,  table,  250 
Density  function,  probability,  8 
Department  of  Defense,  226 
Designs  for  experiments,  vi,  ill 
Destructive  sampling,  210 
Deviation,  12,  13 

Differences  among  several  means,  109-46 
Differences  among  several  standard 
deviations,  78 

Differences  between  two  means 
confidence  intervals,  55,  58 
test  of  significance,  52,  55,  57,  59 
Differences  between  two  proportions,  100 
Differences  (ratio)  between  two 

standard  deviations,  74-78,  chart , 
278 

Discrete  distribution,  1C 
Dispersion,  measures  of,  12,  29 
Distribution,  4,  7 
binomial,  30 
X*.  232 

circular  normal,  28 

confidence  band  for  cumulative,  90 

continuous,  10 

cumulative,  8 

discrete,  10 

F,  234 

frequency,  5 

normal,  19,  229,  230 

Poisson,  32 

population,  7 

sampling,  14 

/,231 

test  for  goodness  of  fit,  85 
two-dimensional,  27,  150 
Dixon,  W.  J.,  35,  64,  81,  106,  145,  194, 
243,  248,  252 
Dodge,  Harold  F.,  226 
Double  sampling,  212 

E 

Effect,  main,  133 

Eisenhart  C,  81,  107,  145,  205,  244,  253 
Equal-tails  test,  16 
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Error 

circular  probable,  29 
probable,  13 
standard,  14 
types  of,  16 
Estimate 

of  population  mean,  1 1 
of  population  proportion,  49 
of  population  standard  deviation,  12 
unbiased,  12 

Estimation,  vi,  see  Confidence  intervals 
by  percentiles,  13 
Experiment,  steps  in  designing  and 
running,  113 
Experimental  design,  111 
Ezekiel,  M„  194 


F 

F  distribution,  table,  234 
F  test 

equal-tails  test,  74 

in  analysis  of  variance,  118,  123,  130, 
134,  140 

in  regression  analysis,  177 
OC  curves,  chart,  276 
one-sided  test,  73 
Factorial  design,  114 
Factors  in  an  experiment,  109 
Fay,  Edward  A.,  226 
Ferris,  C.  D.,  273,  274,  275,  276 
Fiftieth  percentile,  11 
Finite  lot  or  population,  3ft  -ti,  >2 
Finney,  D.  J.,  106 

Fisher,  R.  A.,  107,  145,  230,  231,  251 
Fit,  test  for  goodness  of 
a  distribution,  110 
a  regression  curve,  165,  186,  188 
Fitting  a  Poisson  distribution,  34,  87 
a  curve,  183,  184,  185,  186 
a  line,  152 

a  normal  distribution,  88 
a  plane  or  hyperplane,  168 
a  surface,  183 
Fitting  data,  85 
Fojt,  Albin  A.,  187 
Fraction,  r\  of  y  variation 

accounted  for  by  regression,  157 
Fraction,  r*  (JJ , .  t  k,  of  y  variation 
accounted  for  by  multiple 
regression,  175 
Fractional  replication,  117 
Frankford  Arsenal,  35,  64,  81,  145,  194 
Freedom,  degrees  of,  19,  111 
Freeman,  H.  A.,  65,  226 
Frequency  diagram  or  histogram,  4 


Frequency  distribution 
population,  6 
relative,  5 
sample,  5 
Frequency  table,  5 
Friedman,  M.,  226 

G 

Gardner,  R.  S.,  252 
Goode,  Henry  P.,  225 
Goodness  of  fit,  85,  88 
of  a  distribution,  110 
of  a  regression  curve,  165,  186,  188 
Grant,  E.  L.,  205 
Greenwood,  J.  A.,  277 
Gross  errors,  102,  table,  252 
Grubbs,  F.  E.,  273,  274,  275,  276 

H 

Hald,  A.,  35,  146,  194 
Hartley,  H.  O.,  107 
Hastay,  M.  W.,  107,  205,  253 
Histogram,  5 

for  discrete  distribution,  1 0 
Hit  probabilities,  25 
Hoel,  P.  G.,  81 

Homogeneity  of  variances,  78 
Hotelling,  H.,  205 
Houseman,  E.  E.,  251 
Hypothesis 
test  of,  15 
null,  15 

I 

Independence,  test  for,  in  a  contingency 
table,  97 

Yates’s  correction,  99 
Independent  events,  probability  that  both 
happen,  4,  6,  7 
Independent  samples,  3 
Independent  variables,  4 
Inspection 

100%,  198,  209 
by  attributes,  209 
by  variables,  209 
level,  213 
lot,  211 

Interaction,  109,  114,  133 
tes'  against,  136 

Interpolation,  harmonic,  51,  231,  235 
Intervals,  see  Confidence  intervals, 
Prediction  intervals,  Tolerance 
intervals 


J 

Joint  two-dimensional  distribution,  150 
Juran.  J.  M.,  205 

K 

K,  coefficient  for  two-sided  tolerance 

intervals,  104 

k,  coefficient  for  one-sided  tolerance 
intervals,  105 

ka ,  kr,  kt,  constants  for  variables 
acceptance  plan,  212 
Kempthorne,  O.,  146 
Kendall,  M.  G.,  35 
Kenney,  J.  F.,  194 

L 

L ,  lower  limit  in  variables  inspection, 

209 

X,  as  abscissa  for  OC  curve,  42,  46,  54, 
58,  72,  77,  charts,  273-76 
Xy,  constant  multipliers  in  orthogonal 
polynomials,  186 
Latin-square  design,  116 
Latscha,  R.,  107 
Least  squares,  151,  183 
Level  ot  significance,  16 
Levels  of  factors  in  an  experiment,  109 
Linear  combination,  variance  of,  69 
Linear  hypotheses,  testing,  111,  118 
Linear  regression,  147,  152 
multiple,  168 

simple  (two  variables),  152 
formulas  for  simple  regression,  164 
Linearity 

by  transformations,  169,  183,  184 
test  for,  165,  169,  186,  189 
Lord,  E.,  249 

Lot  tolerance  percent  defective,  LTPD, 
218 

LTPD,  lot  tolerance  percent  defective, 
218 

M 

Af  distribution,  table,  240 
M  test  for  homogeneity  of  variance,  78 
m,  median,  1 1 

m,  number  of  replications,  110,  111, 
120,  133 
Main  effect,  133 
Mann,  H.  B.,  146 

Massey,  F.  J.,  Jr.,  35,  64,  81,  106,  145, 
194,  243,  248 
Maxfield,  M.  W.,  278 


May,  D.  C„  35 
McCarthy,  P.  J.,  106 
Mean,  10 

confidence  intervals,  43,  47 
importance  of,  37 
of  binomial  distribution,  31 
of  Poisson  distribution,  34 
population,  10 
sample,  10 

standard  error  of,  14,  69 
test  of  significance,  38,  44,  221,  charts , 
273-74 

Mean  deviation,  13 
Mean  square  successive  difference,  6a, 
63,  table,  250 

Means,  comparison  of,  see  Differences  in 
means,  Trend  in  means,  Analysis  of 
variance 

Measurement  error,  37,  226 
Measures  of  central  location,  10 
Measures  of  dispersion,  12,  29 
Median,  11 

Merrington,  M„  81,  235,  240 
Method  of  least  squares,  183 
Midrange,  12 
Milne,  W.  E.,  194 

Mixed  models  in  analysis  of  variance, 

118 
Mode,  12 
Molina,  E.  G,  35 
Mood,  A„  146 
Mosteller,  F.,  226 
ft,  population  mean,  10 
Hi — Hh  se*  Difference  between 
two  means 

Multiple  correlation  coefficient 
fy|l*  <  -  .  i,  1/5 
test  of  significance,  178 
Multiple-factor  analysis  of  variance,  139 
Multiple  linear  regression,  168 
Multiple  sampling,  212 
Mutually  exclusive  events,  7 


«»,  sample  size  for  very  large  lot  size, 
215 

^  binomial  coefficient,  31 

National  Bureau  of  Standards,  35,  65, 
226,  229 

National  Defense  Research  Committee, 
65 

Naval  Ordnance  Laboratory,  U.S.,  107 
Naval  Ordnance  Test  Station,  U.S.,  226, 
242,  252,  257,  278 


285 


Negative  correlation,  157 
Non-linear  regression,  183 
Normal  distribution,  19 
as  an  assumption  for  a  test,  38 
cumulative,  20,  table ,  229 
density,  19 

examples  of  use  of  normal  tables,  21 
for  hits  on  a  target,  25 
percentiles  of,  230 
test  for  goodness  of  fit,  88 
uses,  24 

Normal  equations  (in  multiple 
regression),  170 
check  for  computation  of,  172 
check  for  solution  of,  171 
Normal  inspection,  214 
Normal  probability  chart,  29,  chart,  269 
Normal  test 

for  differences  in  means,  53 
for  mean,  g,  38 
OC  curves,  chart,  273 
Normality 

checked  by  graphical  methods,  87 
test  for,  87,  88 

transformations  to  obtain,  88,  89 
Notation  for  control  charts,  198 
Null  hypothesis,  15 

o 

OC,  operating  characteristic  curve,  )  7, 
charts,  273—7 6 
example,  41 

in  ccceptance  sampling,  217 
Occurrences  of  defects,  Poisson 
distribution  for,  34 
One-factor  analysis  of  variance,  120 
One-hundred  percent  inspection,  198, 209 
One-sided  test,  16 

One-variable  classification  in  analysis  of 
variance,  120 

Operating  characteristic  curve,  17, 
charts ,  273-76 
example,  41 

in  acceptance  sampling,  217 
Operating  ratio,  Ro,  218 
Ordinate  to  regression  line,  162 
Ordinate  to  regression  plane  (in  multiple 
regression),  181 

Orthogonal  polynomials,  186,  table,  251 
significance  of  /th-degree  term,  188 


P 

P%,  population  coverage  of  tolerance 
intervals,  18,  104 


Paired  observations 
designing  experiment  with,  52,  55 
normal  test  for  difference  in  means,  53 
sign  test,  56 
/  test,  57 
Parameter,  vi,  12 
Partial  correlation  coefficients,  176 
confidence  intervals,  179 
tests  of  significance,  178 
Partial  regression  coefficients,  169 
confidence  intervals,  179 
tests  of  significance,  179 
PE,  probable  error,  13 
Peach,  P.,  208,  218,  226,  255 
Peak  value,  mode,  12 
Pearson,  E.  S.,  107,  208,  271,  272 
Percentage,  see  Proportion 
Percentage  frequency,  5 
Percentile,  1-1,  13 

Planning  of  experiments,  17,  52,  55,  109, 
191 

Poisson  distribution,  32,  3-1 
Polynomial  regression,  185 
orthogonal  polynomials,  .’86 
significance  of  /th-degree  term,  186 
Pooled  estimate  of  variance,  rl,  68 
Population,  3 

coverage  (of  tolerance  intervals),  18, 
104 
mean,  10 
median,  11 
proportion,  49 
standard  deviation,  12 
variance,  12 
Precision,  67 

Prediction,  by  regression,  147-94 
interval,  163,  182 
Probability,  5 
chart ,  269 

density  function,  8,  19 
distribution,  7 
graph  paper,  87 
laws,  6 

of  Type  I  error,  16 
of  Type  U  error,  16 
Probable  error,  13 
circular,  29 

factor  for  converting  to  standard 
deviation  for  normal  distribution, 
13 

Probit  analysis,  93 
Producer’s  risk,  16 
Production  line,  195,  226 
Proportion,  p  (binomial  distribu¬ 
tion),  30,  48 

confidence  intervals,  51,  52,  table,  257, 
charts,  270-72 
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Proportion,  p  (Contd.) 
test  for  sample  size  n  <  150,  49,  221 
test  for  sample  size  n  >  5 /po,  50,  221 
Proportions,  comparison  of,  100 

0 

Quality  control 
acceptance  sampling,  209-26 
analysis  of  variance,  109-46 
charts,  195-208 

consistency  of  proportions  in  several 
samples,  100 

construction  of  charts,  200,  203 
notation,  198 

run  test  for  randomness,  83 
tolerance  intervals,  18 
Quenouille,  M.  H.,  194 

R 

Ro,  operating  ratio,  218 
r,  correlation  coefficient,  157,  158,  table, 
241,  chart,  279 

r*.  fraction  of  y  variation  accounted  for 
by  regression,  175,  table,  24l 
rvi|u« . . .  partial  correlation 
coefficient,  176 

r»|u . . ,  t,  multiple  correlation 
coefficient,  175 

Radial  error,  root-mean-square,  29 
Radius  of  50%  circle,  29 
Random  numbers,  3 
Random  sample,  3 
Randomization  of  background  condi¬ 
tions  in  an  experiment,  115 
Randomized  block  design,  116 
Randomness,  run  test  for  83,  table,  244 
Range,  13 

correction  facte*  tn  estimate  standard 
deviations,  la:Je,  248 
Rao,  C.  R.,  146 

Ratio  of  two  standard  deviations 
chart,  278 

confidence  intervals,  77 
test  of  significance,  74,  75 
Rational  subgroups,  196 
Re,  rejection  number,  212 
Reduced  inspection,  214 
Regression  analysis,  147-9 4 
linear  with  tvo  variables,  152 
multiple  linear,  168 
non-linear,  183 
polynomial,  185 
Regression  coefficient,  152 
confidence  intervals,  160 


difference  between  two  regression 
coefficients,  161 
partial,  169,  179 
test  of  significance,  160 
Regression  curve  of  y  on  x,  148 
Regression  line,  148,  152 
of  x  on  y,  153 
variation  about,  156 
Regression  ordinate,  confidence 
interval  for,  162,  181 
Regression  plane,  148,  168 
variation  about,  174 
Rejection 
number,  Re,  212 
of  null  hypothesis,  39 
of  outlying  observation,  i02 
region  in  a  test,  39 
Relative  frequency,  5 
Reliability  of  regression  measures,  158 
in  multiple  regression,  177 
Replication,  110,  111,  120,  133 
Rider,  P.  R.,  194 
Risk 

a,  of  Type  I  error,  16 
(3,  of  Type  II  error,  16 
Romig,  Harry  G.,  226 
Run  test 

critical  values,  table,  244 
for  comparing  two  samples,  101 
for  randomness,  83 


s 

s,  sample  standard  deviation,  12 
computation,  12,  68 
computed  from  coded  values,  68 
j8,  sample  variance,  12 
jJ,  pooled  estimate  of  variance,  68 
sy i*,  standard  error  of  estimate  (in 
regression),  156 

jy|i2 . . . standard  error  of  estimate 
in  multiple  regression,  174 
Sample,  3 
mean,  10 
median,  11 
midrange,  12 
mode,  12 
random,  3 

standard  deviation,  12 
variance,  12 

Sample-size  code  letter,  214 
Sample  size  for  fixed  length  of  confidence 
interval 

for  difference  in  means,  55 
for  mean,  43,  48 
for  proportion,  51 
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Sample  size  for  fixed  relative  length  of 
confidence  interval  for  standard 
deviation,  73,  chart,  277 
in  analysis  of  variance,  111 
Sample  size  for  fixed  risks  of  error 
for  difference  in  means,  54,  57 
for  mean,  40,  46 
for  proportion,  217,  table,  255 
for  ratio  of  two  standard  deviations,  77 
for  standard  deviation,  72 
Sampling,  sequential,  220 
Sampling,  single,  double,  or  multiple,  212 
Sampling  distribution,  14 
Sandomire,  M.  M.,  277 
Scale,  change  of,  68 
Scarborough,  J.  B.,  194 
Scatter  diagram,  use  in  regression,  147 
Scheffi,  Henry,  65,  208 
Screening,  198,  209 
Sensitivity  testing,  91 
Sequential  analysis,  vi,  38,  220 
plan  for  proportion  defective,  221 
plan  for  standard  deviation,  224 
truncation,  222 
uses,  221 

Sheppard's  correction,  68 
Shewhart,  W.  A.,  208 
2,  used  to  denote  a  sum,  10 
<r,  population  standard  deviation,  12 
aj,  population  variance,  12 
cro,  experimental  error,  111 
Sign  test,  56,  table,  243 
Significance  level,  16,  41 
Significance  test,  see  Test  of 
significance 
Simon,  Leslie  E.,  64 
Simple  (two- variable)  linear 
regression,  152 
summary  of  formulas  for,  164 
Simultaneous  linear  equations,  171 
Single,  double,  or  multiple 
sampling,  212 

Size  of  sample,  see  Sample  size 
Small  lot,  sampling  from  a,  214 
Small  samples 
charts,  270-79 
disadvantage  of,  17,  40 
quick  test  for  mean,  44 
Smith,  Babington,  35 
Snedecor,  G.  W.,  146,  194,  241 
Sprague,  A.  D.,  29,  269 
Spread  (dispersion),  measures  of,  12 
Stability  in  production,  control 
charts,  195 
Staircase  tests,  93 


Standard  deviation,  12 
comparison  of  two,  74,  chart,  276 
computation  of,  12,  68 
computed  from  coded  values,  68 
confidence  intervals,  73,  table,  242, 
chart,  277 
importance  of,  67 
of  binomial  distribution,  31 
of  Poisson  distribution,  34 
population,  12 
sample,  12 

test  of  significance,  70,  72,  224, 
chart,  275 

Standard  error  of  a  statistic,  14 
Standard  error  of  estimate 
in  multiple  regression,  174 
in  simple  regression,  156 
Standard  error  of  mean,  14 
Standard  for  production,  kept 
by  control  charts,  195 
Standardized  variable,  20,  39 
Star,  *,  to  denote  significance,  16 
Statistic,  vi 

Statistical  hypothesis,  test  of,  see 
Test  of  significance 
Statistical  Research  Group, 

Columbia  University,  107,  226 
Statistical  test,  see  Test  of 
significance 

Steps  in  controlling  quality  with 
use  of  control  charts,  197 
Steps  in  designing  and  running  an 
experiment,  113 
Sum  of  independent  variables, 
variance  of,  69 
Swed,  F.,  244 

T 

t  distribution,  table,  231 
t  test 

compared  with  factorial  design,  115 
for  difference  in  means,  5/,  59 
for  mean,  45,  chart,  274 
in  regression,  147-94 
Target,  probability  of  a  hit,  25 
ti,  test  for  mean,  44 
n ,  test  for  difference  in  means,  57, 
table,  249 

Test  of  hypothesis,  vii,  15 
Test  of  significance,  vii,  15 
equal-tails,  16 
example,  38 

for  mean,  see  Mean,  test  of 
significance;  for  standard 
deviation,  see  Standard  deviation, 
test  of  significance;  etc. 
one-sided,  16 
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Thompson,  C.  M.,  81,  233,  235,  240 
Three-factor  analysis  of  variance,  139 
Throwing  out  readings,  102,  table,  252 
Tightened  inspection,  214 
Tippett,  L.  H.  C.,  208,  248 
Tolerance  intervals,  18,  104,  table,  253 
Transformation,  standardizing,  20,  39 
Transformations  to  obtain  linearity, 

169,  183,  184 

Transformations  to  obtain  normality,  88 
Treatments,  111 

Trend,  described  by  regression,  147-94 
Trend  in  means,  62,  table,  250 
"True"  mean,  standard  deviation,  etc., 
see  Population 

Truncated  sequential  phn,  222 
Tukey,  J.  W.,  106 
Two-factor  analysis  of  variance,  127 
Two-variable  classification  in 
analysis  of  variance,  127 
Two-variable  linear  regression,  152 
summary  of  formulas,  164 
Types  of  error,  16,  4 1 
Types  of  problems  in  analysis  of 
variance,  118 

Types  of  problems  in  regression,  148 

u 

U,  upper  limit  in  variables  inspection, 
209,  212 

Unbiased  estimate,  12 
Unit  of  inspection,  211 
Up-and-down  method,  93 
Upper  limit,  U,  209,  212 

V 

Variable,  standardized,  20 
Variables  testing  by  control  charts  or 
acceptance  sampling,  195,  209 


Variance,  see  Standard  deviation 
additivity,  69,  110 
analysis  of,  109-46 
estimated  from  sample,  12 
homogeniety  of  several  variances,  78 
of  a  function  of  independent 
variables,  69 
population,  12 
sample,  12 

test  of  significance,  70 
Variation 

about  the  regression  line,  156 
about  the  regression  plane,  174 
Villars,  D.  S.,  146 

w 

w,  range,  13 

Wald,  Abraham,  226 
Wallis,  W.  A.,  107,  194,  205,  226,  253 
Weaver,  C.  L.,  273,  274,  275,  276 
Wild  readings,  test  for  throwing 
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America’s  Old  Masters,  James  T.  Flexner.  Four  men  emerged  unexpectedly 
from  provincial  18th  century  America  to  leadership  in  European  art:  Benjamin 
West,  J.  S.  Copley,  C.  R.  Peale,  Gilbert  Stuart.  Brilliant  coverage  of  lives  and  con¬ 
tributions.  Revised,  1967  edition.  69  plates.  365pp.  of  text. 

21806-6  Paperbound  $3.00 

First  Flowers  of  Our  Wilderness:  American  Painting,  The  Colonial 
Period,  James  T.  Flexner.  Painters,  and  regional  painting  traditions  from  earliest 
Colonial  times  up  to  the  emergence  of  Copley,  West  and  Peale  Sr.,  Foster,  Gustavus 
Hesteiius,  Feke,  John  Smibert  and  many  anonymous  painters  in  the  primitive  manner. 
Engaging  presentation,  with  162  illustrations,  xxii  +  368pp. 

22180-6  Paperbound  $3.50 

The  Light  of  Distant  Skies:  American  Painting,,  1760- 183 5,  James  T.  Flex¬ 
ner.  The  great  generation  of  early  American  painters  goes  to  Europe  to  learn  and 
to  teach:  West,  Copley,  Gilbert  Stuart  and  others.  Allston,  Trumbull,  Morse;  also 
contemporary  American  painters — primitives,  derivatives,  academics — who  remained 
in  America.  102  illustrations,  xiii  -f  306pp.  22179-2  Paperbound  $3.00 

A  History  of  thb  Rise  and  Progress  of  the  Arts  of  Design  in  the  United 
States,  William  Dunlap.  Much  the  richest  mine  of  information  on  early  American 
painters,  sculptors,  architects,  engravers,  miniaturists,  etc.  The  only  source  of  in¬ 
formation  for  scores  of  artists,  the  major  primary  source  for  many  others.  Unabridged 
reprint  of  rare  original  1834  edition,  with  new  introduction  by  James  T.  Flexner, 
and  394  new  illustrations.  Edited  by  Rita  Weiss.  6%  x  9Y8- 

21695-0,  21696-9,  21697-7  Three  volumes,  Paperbound  $13.50  “ 

4k 

Epochs  of  Chinese  and  Japanese  Art,  Ernest  F.  Fenollosa.  From  primitive 
Chinese  art  to  the  20th  century,  thorough  history,  explanation  of  every  important  art 
period  and  form,  including  Japanese  woodcuts ;  main  stress  on  China  and  Japan,  but 
Tibet,  Korea  also  included.  Still  unexcelled  for  its  detailed,  rich  coverage  of  cul¬ 
tural  background,  aesthetic  elements,  diffusion  studies,  particularly  of  the  historical 
period.  2nd,  1913  edition.  242  illustrations,  lii  -f  439pp.  of  text. 

20364-6,  20365-4  Two  volumes,  Paperbound  $6.00 

The  Gentle  Art  of  Making  Enemies,  James  A.  M.  Whistler.  Greatest  wit  of  his 
day  deflates  Oscar  Wilde,  Ruskin,  Swinburne ;  strikes  back  at  inane  critics,  exhibi¬ 
tions,  art  journalism;  aesthetics  of  impressionist  revolution  in  most  striking  form. 
Highly  readable  classic  by  great  painter.  Reproduction  of  edition  designed  by 
Whistler.  Introduction  by  Alfred  Werner,  xxxvi  -f  334pp. 

21875-9  Paperbound  $2.50 
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Planets,  Stars  and  Galaxies:  Descriptive  Astronomy  for  Beginners,  A.  E. 
Fanning.  Comprehensive  introductory  survey  of  astronomy:  the  sun,  solar  system, 
stars,  galaxies,  universe,  cosmology ;  up-to-date,  including  quasars,  radio  stars,  etc. 
Preface  by  Prof.  Donald  Menzel.  24pp.  of  photographs.  189pp.  5V4  x  8 Vi. 

21680-2  Paperbound  $1.50 


“i 


Teach  Yourself  Calculus,  P.  Abbott.  With  a  good  background  in  algebra  and 
,  trig,  you  can  teach  yourself  calculus  with  this  book.  Simple,  straightforward  intro- 

I  duction  to  functions  of  all  kinds,  integration,  differentiation,  series,  etc.  "Students 

*  whe  are  beginning  to  study  calculus  method  wilt  derive  great  help  from  this  book." 

/  Faraday  House  Journal.  308pp.  20683*1  Clothbound  $2.00 

Teach  Yourself  Trigonometry,  P.  Abbott.  Geometrical  foundations,  indices  and 
logarithms,  ratios,  angles,  circular  measure,  etc.  are  presented  in  this  sound,  easy-to- 
use  text.  Excellent  for  the  beginner  or  as  a  brush  up,  this  text  carries  the  student 
through  the  solution  of  triangles.  204pp.  20682-3  Clothbound  $2.00 


Teach  Yoursblf  Anatomy,  David  LeVay.  Accurate,  inclusive,  profusely  illus¬ 
trated  account  of  structure,  skeleton,  abdomen,  muscles,  nervous  system,  glands, 
brain,  reproductive  organs,  evolution.  "Quite  the  best  and  most  readable  account,’ 

Medical  Officer,  12  color  plates.  164  figures.  311pp.  4^4  x  7. 

21651-9  Clothbound  $2.50 

Teach  Yourself  Physiology,  David  LeVay.  Anatomical,  biochemical  bases;  di¬ 
gestive,  nervous,  endocrine  systems;  metabolism;  respiration;  muscle;  excretion; 
temperature  control;  reproduction.  "Good  elementary  exposition,”  The  Lancet.  6  f 

color  plates.  44  illustrations.  208pp.  4*4  x  7.  21658-6  Clothbound  $2.50 

Thb  Friendly  Stars,  Martha  Evans  Martin.  Classic  has  taught  naked-eye  observa¬ 
tion  of  stars,  planets  to  hundreds  of  thousands,  still  not  surpassed  for  charm,  lucidity, 
adequacy.  Completely  updated  by  Professor  Donald  H.  Menzel,  Harvard  Observa¬ 
tory.  25  illustrations.  16x30  chart,  x -f-  147pp.  21099*5  Paperbound  $1.25  j 

Music  of  the  Spheres:  The  Material  Universe  from  Atom  to  Quasar, 

Simply  Explained,  Guy  Murchie.  Extremely  broad,  brilliantly  written  popular 
account  begins  with  the  solar  system  and  reaches  to  dividing  line  between  matter  and 
nonmatter;  latest  understandings  presented  with  exceptional  clarity.  Volume  One: 

Planets,  stars,  galaxies,  cosmology,  geology,  celestial  mechanics,  latest  astronomical  I 

discoveries;  Volume  Two:  Matter,  atoms,  waves,  radiation,  relativity,  chemical  \ 

action,  heat,  nuclear  energy,  quantum  theory,  music,  light,  color,  probability,  anti¬ 
matter,  antigravity,  and  similar  topics.  319  figures.  1967  (second)  edition.  Total 
of  xx -f  644pp.  21809-0,  21810-4  Two  volumes,  Paperbound  $5.00 


Old-Time  Schools  and  School  Books,  Clifton  Johnson.  Illustrations  and  rhymes 
from  early  primers,  abundant  quotations  from  early  textbooks,  many  anecdotes  of 
school  life  enliven  this  study  of  elementary  schools  from  Puritans  to  middle  19th 
century.  Introduction  by  Carl  Withers.  234  illustrations,  xxxiii  -{-  381pp. 

21031-6  Paperbound  $2.50 
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The  Principles  of  Psychology,  William  James.  The  famous  long  course,  com¬ 
plete  and  unabridged.  Stream  of  thought,  time  perception,  memory,  experimental 
methods — these  are  only  some  of  the  concerns  of  a  work  that  was  years  ahead  of  its 
time  and  still  valid,  interesting,  useful.  94  figures.  Total  of  xviii  -f  1391pp. 

20381-6,  20382*4  Two  volumes,  Paperbound  $8.00 


The  Strange  Story  of  the  Quantum,  Banesh  Hoffmann.  Non-mathematical 
but  thorough  explanation  of  work  of  Planck,  Einstein,  Bohr,  Pauli,  de  Broglie, 
Schrodinger,  Heisenberg,  Dirac,  Feynman,  etc.  No  technical  background  needed. 
"Of  books  attempting  such  an  account,  this  is  the  best,”  Henry  Margenau,  Yale.  40- 
page  "Postscript  195 9"  xii  +' 285pp.  20518-5  Paperbound  $2.00 

The  Rise  of  the  New  Physics,  A.  d’Abro.  Most  thorough  explanation  in  print 
of  central  core  of  mathematical  physics,  both  classical  and  modern ;  from  Newton  to 
Dirac  and  Heisenberg.  Both  history  and  exposition ;  philosophy  of  science,  causality, 
explanations  of  higher  mathematics,  analytical  mechanics,  electromagnetism,  thermo¬ 
dynamics,  phase  rule,  special  and  general  relativity,  matrices.  No  higher  mathematics 
needed  to  follow  exposition,  though  treatment  is  elementary  to  intermediate  in  level. 
Recommended  to  serious  student  who  wishes  verbal  understanding.  97  illustrations, 
xvii  -f  982pp.  20003-5,  20004-3  Two  volumes,  Paperbound  $6.00 

Great  Idbas  of  Operations  Research,  Jagjit  Singh.  Easily  followed  non-techni- 
cal  explanation  of  mathematical  tools,  aims,  results:  statistics,  linear  programming, 
game  theory,  queueing  theory,  Monte  Carlo  simulation,  etc.  Uses  only  elementary 
mathematics.  Many  case  studies,  several  analyzed  in  detail.  Clarity,  breadth  make 
this  excellent  for  specialist  in  another  field  who  wishes  background.  41  figures, 
x  -f-  228pp.  21886-4  Paperbound  $2.50 

Great  Ideas  of  Modern  Mathematics:  Their  Nature  and  Lse,  Jagjit  Singh. 
Internationally  famous  expositor,  winner  of  Unesco’s  Kalinga  Award  for  science 
popularization  explains  verbally  such  topics  as  differential  equations,  matrices, 
groups,  sets,  transformations,  mathematical  logic  and  other  important  modern 
mathematics,  as  well  as  use  in  physics,  astrophysics,  and  similar  fields.  Superb 
exposition  for  layman,  scientist  in  other  areas,  viii  -f-  312pp. 

20587-8  Paperbound  $2.50 

Great  Ideas  in  Information  Theory,  Language  and  Cybernetics,  Jagjit 
Singh.  The  analog  and  digital  computers,  how  they  work,  how  they  are  like  and 
unlike  the  human  brain,  the  men  who  developed  them,  their  future  applications,  com¬ 
puter  terminology.  An  essential  book  for  today,  even  for  readers  with  little  math. 
Some  mathematical  demonstrations  included  for  more  advanced  readers.  118  figures. 
Tables,  ix  -f-  338pp.  21694-2  Paperocund  $2.50 


Chance,  Luck  and  Statistics,  Horace  C.  Levinson.  Non-mathematical  presenta¬ 
tion  of  fundamentals  of  probability  theory  and  science  of  statistics  and  their  applica¬ 
tions.  Games  of  chance,  betting  odds,  misuse  of  statistics,  normal  and  skew  distribu¬ 
tions,  birth  rates,  stock  speculation,  insurance.  Enlarged  edition.  Formerly  "The 
Science  of  Chance.”  xiii  -j-  357pp.  21007-3  Paperbound  $2.50 
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Mathematical  Foundations  of  Statistical  Mechanics,  A.  I.  Khinchin. 
Introduction  to  modern  statistical  mechanics:  phase  space,  ergodic  problems,  theory 
of  probability,  central  limit  theorem,  ideal  monatomic  gas,  foundation  of  thermo¬ 
dynamics,  dispersion  and  distribution  of  sum  functions.  Provides  mathematically 
rigorous  treatment  and  excellent  analytical  tools.  Translated  by  George  Gamow. 
viii  -f  179pp.  60147-1  Paperbound  $2.50 

Introduction  to  Physical  Statistics,  Robert  B.  Lindsay.  Elementary  prob¬ 
ability  theory,  laws  of  thermodynamics,  classical  Maxwell-fioltzmann  statistics, 
classical  statistical  mechanics,  quantum  mechanics,  other  areas  of  physics  that  can 
be  studied  statistically.  Full  coverage  of  methods;  basic  background  theory,  ix 
-f  306pp.  61882-X  Paperbound  $2.75 

Dialogues  Concerning  Two  New  Sciences,  Galileo  Galilei.  Written  near  the 
end  of  Galileo’s  life  and  encompassing  30  years  of  experiment  and  thought,  these 
dialogues  deal  with  geometric  demonstrations  of  fracture  of  solid  bodies,  cohesion, 
leverage,  speed  of  light  and  sound,  pendulums,  falling  bodies,  accelerated  motion, 
etc.  Translated  by  Henry  Crew  and  Alfonso  de  Salvio.  Introduction  by  Antonio 
Favaro.  xxiii  +  300pp.  60099-8  Paperbound  $2.25 

Foundations  of  Science:  The  Philosophy  of  Theory  and  Experiment, 
Norman  R.  Campbell.  Fundamental  concepts  of  science  examined  on  middle  level: 
acceptance  of  propositions  and  axioms,  presuppositions  of  scientific  thought,  scien¬ 
tific  law,  multiplication  of  probabilities,  nature  of  experiment,  application  of  math¬ 
ematics,  measurement,  numerical  laws  and  theories,  error,  etc.  Stress  on  physics, 
but  holds  for  other  sciences.  "Unreservedly  recommended,”  Nature  (England). 
Formerly  Physics:  The  Elements,  ix  4-  565pp.  60372-5  Paperbound  $4.00 

The  Phase  Rulb  and  Its  Applications,  Alexander  Findlay,  A.  N.  Campbell 
and  N.  O.  Smith.  Findlay’s  well-known  classic,  updated  (1951).  Full  standard 
text  and  thorough  reference,  particularly  useful  for  graduate  students.  Covers 
chemical  phenomena  of  one,  two,  three,  four  and  multiple  component  systems. 
"Should  rank  as  the  standard  work  in  English  on  the  subject,”  Nature.  236  figures, 
xii  +  494pp.  60091-2  Paperbound  $3.50 

Thermodynamics,  Enrico  Fermi.  A  classic  of  modern  science.  Clear,  organized 
treatment  of  systems,  first  and  second  laws,  entropy,  thermodynamic  potentials, 
gaseous  reactions,  dilute  solutions,  entropy  constant.  No  math  beyond  calculus  is 
needed,  but  readers  are  assumed  to  be  familiar  with  fundamentals  of  thermometry, 
calorimetry.  22  illustrations.  25  problems,  x  -f  160pp. 

60361-X  Paperbound  $2.00 

Treatise  on  Thermodynamics,  Max  Planck.  Classic,  still  recognized  as  one  of 
the  best  introductions  to  thermodynamics.  Based  on  Planck’s  original  papers,  it 
presents  a  concise  and  logical  view  of  the  entire  field,  building  physical  and 
chemical  laws  from  basic  empirical  facts..  Planck  considers  fundamental  definitions, 
first  and  second  principles  of  thermodynamics,  and  applications  to  special  states 
of  equilibrium.  Numerous  worked  examples.  Translated  by  Alexander  Ogg.  5 
figures,  xiv  -f  297pp.  60219-2  Paperbound  $2.50 
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Einstein’s  Theory  of  Relativity,  Max  Bom.  Relativity  theory  analyzed,  ex¬ 
plained  for  intelligent  layman  or  student  with  some  physical,  mathematical  back¬ 
ground.  Includes  Lorentz,  Minkowski,  and  others.  Excellent  verbal  account  for 
teachers.  Generally  considered  the  finest  non-technical  account,  vii  -f  376pp. 

60769-0  Paperbound  $2.75 


Physical  Principles  of  the  Quantum  Theory,  Werner  Heisenberg.  Nobel 
Laureate  discusses  quantum  theory,  uncertainty  principle,  wave  mechanics,  work 
of  Dirac,  Schroedinger,  Compton,  Wilson,  Einstein,  etc.  Middle,  non-mathe- 
matical  level  for  physicist,  chemist  not  specializing  in  quantum;  mathematical 
appendix  for  specialists.  Translated  by  C.  Eckart  and  F.  Hoyt.  19  figures,  viii 
-f-  184pp.  60113-7  Paperbound  $2.00 

Principles  of  Quantum  Mechanics,  William  V.  Houston.  Fo  student  with 
working  knowledge  of  elementary  mathematical  physics ;  uses  Schroedinger’s  wave 
mechanics.  Evidence  for  quantum  theory,  postulates  of  quantum  mechanics,  appli¬ 
cations  in  spectroscopy,  collision  problems,  electrons,  similar  topics.  21  figures. 
288pp.  60524-8  Paperbound  $3.00 

4 

Atomic  Spectra  and  Atomic  Structure,  Gerhard  Herzberg.  One  of  the  best 
introductions  to  atomic  spectra  and  their  relationship  to  structure;  especially  suited 
to  specialists  in  other  fields  who  require  a  comprehensive  basic  knowledge.  Treat¬ 
ment  is  physical  rather  than  mathematical.  2nd  edition.  Translated  by  J.  W.  T. 
Spinks.  80  illustrations,  xiv  -f  257pp.  60115-3  Paperbound  $2.00 


Atomic  Physics:  An  Atomic  Description  of  Physical  Phenomena,  Gaylord 
P.  Harnwell  and  William  E,  Stephens.  One  of  the  best  introductions  to  modern 
quantum  ideas.  Emphasis  on  the  extension  of  classical  physics  into  the  realms  of 
atomic  phenomena  and  the  evolution  of  quantum  concepts.  156  problems.  173 
figures  and  tables,  xi  -f  401pp.  61584-7  Paperbound  $3-00 


Atoms,  Molecules  and  Quanta,  Arthur  E.  Ruark  and  Harold  C.  Urey.  1964 
edition  of  work  that  has  been  a  favorite  of  students  and  teachers  for  30  years. 
Origins  and  major  experimental  data  of  quantum  theory,  development  of  concepts 
of  atomic  and  molecular  structure  prior  to  new  mechanics,  laws  and  basic  ideas 
of  quantum  mechanics,  wave  mechanics,  matrix  mechanics,  general  theory  of 
quantum  dynamics.  Very  thorough,  lucid  presentation  for  advanced  students.  230 
figures.  Total  of  xxiii  +  810pp. 

61106-X,  63107-8  Two  volumes,  Paperbound  $6.00 


Investigations  on  the  Theory  of  thb  Brownian  Movement,  Albert  Ein¬ 
stein.  Five  papers  (1905-1908)  investigating  the  dynamics  of  Brownian  motion 
and  evolving  an  elementary  theory  of  interest  to  mathematicians,  chemists  and 
physical  scientists.  Notes  by  R.  Fiirth,  the  editor,  discuss  the  history  of  study  of 
Brownian  movement,  elucidate  the  text  and  analyze  the  significance  of  the  papers. 
Translated  by  A.  D.  Cowper.  3  figures,  iv  -f  122pp. 

60304-0  Paperbound  $1.50 
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i. 

y  Microscopy  for  Chemists,  Harold  F.  Schaeffer.  Thorough  text;  operation  of 

l.,  microscope,  optics,  photomicrographs,  hot  stage,  polarized  light,  chemical  pro- 

>  cedures  for  organic  and  inorganic  reactions.  32  specific  experiments  cover  specific 

|  analyses:  industrial,  metals,  other  important  subjects.  136  figures.  264pp. 

|  61682-7  Paperbound  $2.50 
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I  Opticks,  Sir  Isaac  Newton.  A  survey  of  18th-century  knowledge  on  all  aspects 

f  of  light  as  well  as  a  description  of  Newton’s  experiments  with  spectroscopy,  colors, 

I  lenses,  reflection,  refraction,  theory  of  waves,  etc.  in  language  the  layman  can 

-  follow.  Foreword  by  Albert  Einstein.  Introduction  by  Sir  Edmund  Whittaker. 

i  Preface  by  I.  Bernard  Cohen,  cxxvi  -f-  406pp.  60203-2  Paperbound  $4.00 

Light:  Principles  and  Experiments,  George  S.  Monk.  Thorough  coverage,  for 
student  with  background  in  physics  and  math,  of  physical  and  geometric  optics. 
Also  includes  23  experiments  on  optical  systems,  instruments,  etc.  "Probably  the 
best  intermediate  text  on  optics  in  the  English  language,”  Physics  Forum.  273 
figures,  xi  -f  489pp.  60341-5  Paperbound  $3.50 
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Physical  Optics,  Robert  W.  Wood.  A  classic  in  the  field,  this  is  a  valuable 
source  for  students  of  physical  optics  and  excellent  background  material  for  a 
study  of  electromagnetic  theory.  Partial  contents:  nature  and  rectilinear  propaga¬ 
tion  of  light,  reflection  from  plane  and  curved  surfaces,  refraction,  absorption  and 
dispersion,  origin  of  spectra,  interference,  diffraction,  polarization,  Raman  effect, 
optical  properties  of  metals,  resonance  radiation  and  fluorescence  of  atoms,  magneto¬ 
optics,  electro-optics,  thermal  radiation.  462  diagrams,  17  plates,  xvi  -f  846pp. 

61808-0  Paperbound  $4.50 

Mirrors,  Prisms  and  Lenses:  A  Textbook  of  Geometrical  Optics,  James 
P.  C.  Southall.  Introductory-level  account  of  modern  optical  instrument  theory, 
covering  unusually  wide  range:  lights  and  shadows,  reflection  of  light  and  plane 
mirrors,  refraction,  astigmatic  lenses,  compound  systems,  aperture  and  field  of 
optical  system,  the  eye,  dispersion  and  achromatism,  rays  of  finite  slope,  the  micro¬ 
scope,  much  more.  Strong  emphasis  on  earlier,  elementary  portions  of  field,  utiliz¬ 
ing  simplest  mathematics  wherever  possible.  Problems.  329  figures,  xxiv  -f 
806pp.  61234-1  Paperbound  $5.00 

The  Psychology  of  Invention  in  the  Mathematical  Field,  Jacques  Hada- 
mard.  Important  French  mathematician  examines  psychological  origin  of  ideas, 
role  of  the  unconscious,  importance  of  visualization,  etc.  Based  on  own  experi¬ 
ences  and  reports  by  Dalton,  Pascal,  Descartes,  Einstein,  Poincar6,  Helmholtz,  etc. 
xiii  -f  145pp.  20107-4  Paperbound  $1.50 


Introduction  to  Chemical  Physics,  John  C.  Siatei.  A  work  intended  to  bridge 
the  gap  between  chemistry  and  physics.  Text  divided  into  three  parts:  Thermo¬ 
dynamics,  Statistical  Mechanics,  and  Kinetic  Theory;  Gases,  Liquids  and  Solids; 
and  Atoms,  Molecules  and  the  Structure  of  Matter,  which  form  the  basis  of  the 
approach.  Level  is  advanced  undergraduate  to  graduate,  but  theoretical  physics 
held  to  minimum.  40  tables,  118  figures,  xiv  +  522pp. 

62562-1  Paperbound  $4.00 
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Mathematical  Puzzles  for  Beginners  and  Enthusiasts,  Geoffrey  Mott-Smith. 
189  puzzles  from  easy  to  difficult — involving  arithmetic,  logic,  algebra,  properties 
of  digits,  probability,  etc. — for  enjoyment  and  mental  stimulus.  Explanation  of 
mathematical  principles  behind  the  puzzles.  135  illustrations,  viii  -f  248pp. 

20198-8  Paperbound  $1.75 

Papbr  Folding  for  Beginners,  William  D.  Murray  and  Francis  J.  Rigney.  Easiest 
book  on  the  market,  clearest  instructions  on  making  interesting,  beautiful  origami. 
Sail  boats,  cups,  roosters,  frogs  that  move  legs,  bonbon  boxes,  standing  birds,  etc. 
40  projects;  more  than  275  diagrams  and  photographs.  94pp. 

20713-7  Paperbound  $1.00 

Tricks  and  Games  on  the  Pool  Table,  Fred  Herrmann.  79  tricks  and  games— 
some  solitaires,  some  for  two  or  more  players,  some  competitive  games — to  entertain 
you  between  formal  games.  Mystifying  shots  and  throws,  unusual  caroms,  tricks 
involving  such  props  as  cork,  coins,  a  hat,  etc.  Formerly  Fun  on  the  Fool  Table. 
77  figures.  95pp.  21814-7  Paperbound  $1.00 

Hand  Shadows  to  be  Thrown  Upon  the  Wall:  A  Series  of  Novel  and 
Amusing  Figures  Formed  bv  the  Hand,  Henry  Bursill.  Delightful  picturebook 
from  great-grandfather’s  day  shows  how  to  make  18  different  hand  shadows:  a  bird 
that  flies,  duck  that  quacks,  dog  that  wags  his  tail,  camel,  goose,  deer,  boy,  turtle, 
etc.  Only  book  of  its  sort,  vi -f  33pp.  6V2  x  9V^.  21779-5  Paperbound  $1.00 

Whittling  and  Woodcarving,  E.  f.  Tangerman.  18th  printing  of  best  book  on 
market.  "If  you  can  cut  a  potato  you  can  carve”  toys  and  puzzles,  chains,  chessmen, 
caricatures,  masks,  frames,  woodcut  blocks,  surface  patterns,  much  more.  Information 
on  tools,  woods,  techniques.  Also  goes  into  serious  wood  sculpture  from  Middle 
Ages  to  present,  East  and  West.  464  photos,  figures,  x  -j-  293pp. 

20965-2  Paperbound  $2.00 

History  of  Philosophy,  Julian  Marias.  Possibly  the  clearest,  most  easily  followed, 
best  planned,  most  useful  one-volume  history  of  philosophy  on  the  market ;  neither 
skimpy  nor  overfull.  Full  details  on  system  of  every  major  philosopher  and  dozens 
of  less  important  thinkers  from  pre-Socratics  up  to  Existentialism  and  later.  Strong 
on  many  European  figures  usually  omitted.  Has  gone  through  dozens  of  editions  in 
Europe.  1966  edition,  translated  by  Stanley  Appelbaum  and  Clarence  Strowbridge. 
xviii  -f  505pp.  21739-6  Paperbound  $3.00 

Yoga:  A  Scientific  Evaluation,  Kovoor  T.  Behanan.  Scientific  but  non-technicai 
study  of  physiological  results  of  yoga  exercises;  done  under  auspices  of  Yale  U. 
Relations  to  Indian  thought,  to  psychoanalysis,  etc.  16  photos,  xxiii  -f  270pp. 

20505-3  Paperbound  $2.50 


Prices  subject  to  change  without  notice. 

Available  at  your  bock  dealer  or  write  for  free  catalogue  to  Dept.  GI,  Dover 
Publications,  Inc.,  180  Varick  St.,  N.  Y.,  N.  Y.  10014.  Dover  publishes  more  than 
150  books  each  year  on  science,  elementary  and  advanced  mathematics,  biology, 
music,  art,  literary  history,  social  sciences  and  other  areas. 
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The  Advanced  Geometry  of  Plane  Curves  and  Their  Applica¬ 
tions,  C.  Zwikker.  (61078-0)  $2.50 
Trigonometrical  Series,  Antoni  Zygmund.  (60290-7)  $2.50 
Probability  Theory,  A.  R.  Arthurs.  (61724-6)  $1.25 
Solutions  of  Laplace’s  Equations,  D.  R.  Bland.  (61452-2)  $1.25 
Vibrating  Strings,  D.  R.  Bland.  (61451*4)  $1.25 
Vibrating  Systems,  R.  F.  Chisnell.  (61453-0)  $1.25 
Linear  Equations,  P.  M.  Cohn.  (61455-7)  $1.25 
Solid  Geometry,  P.  M.  Cohn.  (61454-9)  $1.25 
Principles  of  Dynamics,  M.  B.  Glauert.  (61456-5)  $1,25 
Sequences  and  Series,  J.  A.  Green.  (61457-3)  $1.25 
Sets  and  Groups,  J.  A.  Green.  (61458-1)  $1.25 
Differential  Calculus,  P.  J.  Hilton.  (6I459-X)  $1.25 
Partial  Derivatives,  P.  J.  Hilton.  (61460-3)  $1.25 
Electrical  and  Mechanical  Oscillations,  D.  S.  Jones.  (61461-1) 
$1.25 

Complex  Numbers,  W.  Ledermann.  (61462-X)  $1.25 
Integral  Calculus,  W.  Ledermann.  (61463-8)  $1.25 
Multiple  Integrals,  W.  Ledermann.  (61723-8)  $1.25 
Numerical  Approximation,  B.  R.  Morton.  (61464-6)  $1.25 
Elementary  Differential  Equations  and  Operators,  G.  E.  H. 
Reuter.  (61465-4)  $1.25 

Fourier  and  Laplace  Transforms,  Peter  D.  Robinson.  (62083-2) 
$1.25 

Fourier  Series,  I.  N.  Sneddon.  (61466-2)  $1.25 
Differential  Geometry,  K.  L.  Wardle.  (61467-0)  $1.25 
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Statistics  Manual 

EdwiL  Crew-  Frances  iDaws-MarprtWJaxfialil 

This  manual  Was  prepare;  under  the  auspicas  of  tha  U.S.  Naval  Ordinance  Tast  Station, 
China  lake,  California,  to  provide  a  comprehensive  collection  of  classical  and  modarn 
mathods  of  making  statistical  Inferences.  Tha  many  axampfas  takan  from  ordnanco 
davalopmant  will  prova  valuable  to  workers  In  all  fields  which  use  statistics,  including 
tha  social  sciences,  as  wall  as  tha  physical  sciences  and  engineering.  Formulas, 
explanations,  and  mathods  of  application  are  given,  with  emphasis  Throughout  on  use. 
The  reader  is  assumed  to  have  a  basic  knowledge  of  statistics. 

Information  is  given  on  fiducial  limits,  power  of  a  test,  determination  of  sample  size, 
sign  tests,  relations  of  means  and  variances,  Chi-Square  runs,  transformations  for 
obtaining  normality,  sensitivity,  analysis  of  variance  tests  for  homogeneity  of  variances, 
linear,  multiple,  and  non-linear  regression,  cqrrelation,  reliability,  confidence  intervals 
for  predicted  values,  quality  control,  measures  of  central  tendency,  measures  of  disper¬ 
sion,  various  distributions,  and  many  other  areas  and  topics. 

An  appendix  of  32  valuable  tables  and  charts  is  included,  several  of  which  are  difficult 
to  find  elsewhere.  They  indicate  critical  values  for  the  M  Distribution  for  testing  the 
null  hypothesis  that  k  populations  have  the  same  variance;  critical  values  for  runs? 
critical  values  for  tests  using  Dm  range;  probabilities  of  shots  falling  in  certain  2- 
dimensional  places,  confidence  belts  for  proportions,  OC  curves,  etc. 

"Well  written  .  .  .  excellent  reference  work,"  Operations  Research. 

Unabridged,  corrected  reprinting  of  NAVORD  Report  3369  NOTS  948,  "Statistics  Manual." 
Introduction.  Appendix  of  21  tables,  11  charts,  Index.  Bibliography  95  illustrations, 
charts,  tables  in  text,  xvii  +  288pp.  x  8.  60599X  Paperbound 
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